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Abstract 

We develop a theory of Darboux transformations for CMV matrices, canonical repre¬ 
sentations of the unitary operators. In perfect analogy with their self-adjoint version the 
Darboux transformations of Jacobi matrices - they are equivalent to Laurent polynomial 
modifications of the underlying measures. We address other questions which emphasize 
the similarities between Darboux transformations for Jacobi and CMV matrices, like their 
(almost) isospectrality or the relation that they establish between the corresponding or¬ 
thogonal polynomials, showing also that both transformations are connected by the Szego 
mapping. 

Nevertheless, we uncover some features of the Darboux transformations for CMV 
matrices which are in striking contrast with those of the Jacobi case. In particular, when 
applied to CMV matrices, the matrix realization of the inverse Darboux transformations 
- what we call ‘Darboux transformations with parameters’ - leads to spurious solutions 
whose interpretation deserves future research. Such spurious solutions are neither unitary 
nor band matrices, so Darboux transformations for CMV matrices are much more subject 
to the subtleties of the algebra of infinite matrices than their Jacobi counterparts. 

A key role in our theory is played by the Cholesky factorizations of infinite matrices. 
Actually, the Darboux transformations introduced in this paper are based on the Cholesky 
factorizations of degree one Hermitian Laurent polynomials evaluated on CMV matrices. 

These transformations are also generalized to higher degree Laurent polynomials, as well 
as to the extension of CMV matrices to quasi-definite functionals - what we call ‘quasi- 
CMV’ matrices. 

Furthermore, we show that this CMV version of Darboux transformations plays a role 
in integrable systems like the Schur flows or the Ablowitz-Ladik model which parallels 
that of Darboux for Jacobi matrices in the Toda lattice. 
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1 Introduction 

Darboux transformations, originally tied to Schrodinger operators [14] [15] 43] (44, [45l 
mm [68] . have become a powerful tool in many areas of mathematical physics (see 
[36,00] [59,161 1 (77 1 78] and references therein). For instance, the role of ID Schrodinger 
operators in the Lax pair of the KdV equation led to export Darboux transformations to 
KdV and generalizations [59]. The discrete version of these relations links Darboux for 
Jacobi matrices (discrete ID Schrodinger) and the Toda lattice (discrete KdV) [77], 78] . 

Darboux transformations of Jacobi matrices are at the center of a rich interplay 
among integrablc systems, orthogonal polynomials, bispectral problems and numerical 
linear algebra 0(610 Ml EU M, (2H1 HEHS! EH M, EH EUSH EH (H (75] EH (HU [85] . These 
topics are also closely related to the unitary counterpart of Jacobi, the CMV matrices, 
which date back to works on the unitary eigenproblem [H [8[ [81], a decade before their 
rediscovery in the context of orthogonal polynomials on the unit circle [nUTOUTT]. It 
was later realized that CMV matrices provide the Lax pair of integrablc systems known 
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under the name of Schur flows (discrete mKdV and unitary analogue of Toda) and 
Ablowitz-Ladik (discrete nonlinear Schrodinger) [2J [3] [22], [30] [32] [52] [5U [63] EH 1651 173] . 
Yet, Darboux has not been applied to CMV matrices so far, the closest precedents being 
on related issues for isometric Hessenberg matrices [T6] 125, 2 T, EH 421 ED]. 

This paper develops a theory of Darboux transformations for CMV matrices which 
intends to be useful for applications which parallel those of the Jacobi case. Indeed, we 
will see that this extension of Darboux has a role in Ablowitz-Ladik and Schur flows 
which mimics that of Darboux for Jacobi in Toda: Darboux for CMV is an integrable 
discretization of such flows which also generates new flows from known ones. Besides, 
the success of Darboux in Jacobi bispectral problems suggests that Darboux for CMV 
may open new ways in the search for bispectral situations on the unit circle, where it 
seems difficult to escape from trivial instances. Further, the recent discovery that CMV 
drives the evolution of ID quantum walks [9] (quantum version of random walks coming 
from the discrete ID Dirac equation mm) could add new uses of Darboux for CMV. 

Let us summarize the main ideas and results of the paper using the Jacobi case as 
a guide and counterpoint. 

The strategy for the extension of Darboux to CMV is to transform every CMV into 
a self-adjoint band matrix M to which we apply standard Darboux transformations 
based on triangular band factorizations followed by a commutation of the factors: M = 
AB —> N = BA , with A, B lower/upper triangular band matrices. 

Darboux is usually implemented on a Jacobi matrix via the LU factorization of 
a similar non-symmetric tridiagonal matrix. However, it is possible to rewrite these 
transformations by performing Cholesky factorizations ( B = A + , the adjoint of A) 
directly onto the Jacobi matrix, which avoids loosing the hermiticity in the process. We 
will use the Cholesky approach for the CMV version of Darboux because it permits a 
closer control of unitarity, a more intricate property than hermiticity. 

In the case of a Jacobi matrix J, the Cholesky factorization may require a previous 
real shift M = J + [31 (/ stands for the identity matrix) to deal with a positive 
definite matrix. As a natural CMV translation of this we will introduce, not only an 
eventual real shift, but also a Hermitian linear combination of a CMV C and its adjoint 
C + = C~ l mapping it into a self-adjoint band matrix M = aC + aC~ l + [31 . In other 
words, while Darboux for Jacobi involves a real polynomial p(x) = x + (3 evaluated on 
the Jacobi matrix, in the CMV case we will evaluate a Hermitian Laurent polynomial 
f[(z) = az + az -1 + [3 on the CMV matrix. 

Despite the above difference, we will discover that Darboux for Jacobi and CMV 
have a similar meaning since they are respectively equivalent to polynomial and Laurent 
polynomial modifications of the underlying measure. This is part of the first main result 
of the paper, Theorem 13.21 which yields several equivalences for Darboux transforma¬ 
tions of CMV matrices in perfect analogy with the Jacobi case. Thus, we will show that 
Darboux for Jacobi and CMV share many properties concerning not only the matrices, 
but also the related measures and orthogonal polynomials. Some of these properties are 
reflected in the following table of analogies, which also includes a summary of notations 
used along the paper. 
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Darboux for Jacobi 
with polynomial p(x) = x+/3 

Darboux for CMV 

with Laurent pol. £(z) = az + az -1 + j3 

Jacobi J *-h>- K. 

Measures on u AA u 
the real line y 

Orthogonal p* \p = ('Pn) 

polynomials y 1 \ q = (q n ) 

CMV C^V 

Measures on t 

the unit circle y 

Orthogonal t ix=(Xn) 

Laurent pol. A [cu = (u n ) 

K, = A~ 1 JA 

p{J) = AA + p(/C) = A + A 

A 2-band lower triangular 

V = A~ X CA 

£(C) = AA + £(V) = A + A 

A 3-band lower triangular 

R. 

R 

II 

R. 

-R 

du = £ dp 

P = Aq p n e span{g n _i, q n } 

pq = A + p pq n e span{p n ,p n+ i} 

x = Acu Xn e span{u; n _ 2 , w„_i, u n ) 

£uj = A + X £u n 6 spanjxn, Xn+ 1 , Xn+ 2 } 

Almost isospectral 
(isospectral up to at most 1 point) 

Almost isospectral 
(isospectral up to at most 2 points) 


The above similarities are not the only results supporting this version of Darboux 
for CMV as the unitary analogue of Darboux for Jacobi. What is more, both transfor¬ 
mations turn out to be directly related by a natural link between the real line and the 
unit circle, the so called Szego connection [76]. This relation is given in Theorem 15.21 
which can be also considered a central result of the paper. 

Having said that, major differences between Jacobi and CMV will appear in the 
matrix realization of the inverse Darboux transformations, M = AA + <— N = A + A. 
We refer to such a matrix realization as the Darboux transformations with parameters 
because they provide parametric solutions originated by the lack of uniqueness of the 
reversed Cholesky factorizations N = A + A. In the Jacobi case these parametric solu¬ 
tions coincide with the Jacobi matrices obtained by inverse Darboux and depend on 
a single real parameter. However, they depend on four real parameters in the CMV 
case, a symptom of a deeper dissimilarity between Jacobi and CMV: Darboux with 
parameters for CMV not only yields the solutions of inverse Darboux, but also leads to 
spurious solutions which are not CMV, not even band nor unitary. The origin of this 
difference with respect to Jacobi is that transforming a unitary matrix into a Hermitian 
one requires a Laurent polynomial with at least two zeros instead of a polynomial of de¬ 
gree one. This resembles the situation in the case of higher degree Darboux for Jacobi, 
based on transforming the Jacobi matrix by a polynomial of degree greater than one. 
Hence, the matrix implementation of inverse Darboux for CMV needs a constraint on 
the reversed Cholesky factorizations to select those leading to CMV solutions. This con¬ 
straint is given in Theorem 14.31 the second main result of the paper, which summarizes 
equivalent ways of separating CMV and spurious solutions. 

The need to deal with spurious solutions which are neither unitary nor band matrices 
requires a careful manipulation of infinite matrices for the Darboux transformations 
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of CMV matrices. Properties like the associativity of matrix multiplication or the 
uniqueness of the inverse matrix may fail [13]. These troubles are not so evident for 
Jacobi matrices due to the absence of spurious solutions. Luckily enough, the situation 
in the CMV case will be successfully handled thanks to the lower Hessenberg type 
structure of the spurious solutions (only finitely many upper diagonals are non-null). 
These reasons make it advisable to specify from the very beginning the kind of matrix 
operations that will be admissible and their properties. This is necessary to legitimate 
matrix manipulations, but also sheds light on some aspects of Darboux transformations 
which, even in the Jacobi case, are not usually explicitly addressed. 

Darboux transformations for CMV matrices can be compared with previous matrix 
transformations based on factorizations. A precedent of this is the QR algorithm for 
CMV matrices, equivalent to a special type of Laurent polynomial modifications of the 
orthogonality measure [SI]. Nevertheless, we will see that QR has a more limited appli¬ 
cability than Darboux, which can be considered as a way to extend the QR algorithm 
to general Laurent polynomial modifications of measures. 

Analogously to the Jacobi case, Darboux makes sense for higher degree Laurent 
polynomial transformations of CMV matrices, as well as for situations related to quasi- 
definite functionals, although this requires the generalization of Cholesky factorizations 
and CMV matrices beyond the positive definite case HU. 

The above ideas are developed in the paper according to the following schedule: 
Section [2] introduces the basic setting in which the rest of the paper will be conducted. 
It covers the explicit description of the algebra of infinite matrices that will be used, 
the analysis of the matrix representations of the multiplication operator with respect to 
ordered bases of Laurent polynomials (‘zig-zag’ bases) and general Darboux factoriza¬ 
tions for Hermitian Laurent polynomials evaluated on such matrix representations. A 
particularization of these factorizations leads to the Darboux transformations for CMV 
matrices in Section [3] whose main result, Theorem 13.21 gives several characterizations 
of such transformations. Inverse Darboux transformations and their matrix realization, 
the Darboux transformations with parameters, are accounted for in Section HJ It in¬ 
cludes the discussion about spurious solutions and the characterization of the CMV 
ones in Theorem 14.31 the central result. Section [5] compares Darboux for Jacobi and 
CMV, linking them via the Szego mapping in Theorem 15.21 and showing their relation 
with a new connection between the real line and the unit circle recently obtained [20]. 
In Section [H] we find a comparison between Darboux transformations and the QR al¬ 
gorithm for CMV matrices, which also serves to rewrite the former ones in operator 
language. Section [7] uncovers the close relation between Darboux for CMV and certain 
integrable systems, namely, the Schur flows and the Ablowitz-Ladik model. Higher 
degree Darboux transformations and the extension to quasi-dehnite functionals appear 
in Section [8j Section [9] summarizes the conclusions, pointing out some open problems 
suggested by the previous results. Finally, Appendix [A] deals with the existence and 
uniqueness of Cholesky factorizations for infinite matrices, crucial issues in the devel¬ 
opment of the paper. Some illustrative examples of Darboux transformations for CMV 
matrices can be found at the end of Sections [J [5] and 18.21 
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2 Hessenberg type matrices, zig-zag bases and Dar- 
boux factorizations 

The Darboux transformation of a Jacobi matrix J follows from a factorization of the 
symmetric matrix polynomial p(J) = J + /3I for some parameter /3el (see Section [5]). 
The translation of this idea to the case of a CMV matrix C requires the substitution of 
the polynomial p(x) = x + (3 by a Hermitian Laurent polynomial £(z) = az + /3 + aV _1 , 
a e C\{0}, f3 e M, so that the matrix Laurent polynomial £{C) = aC + f3I + aC” 1 
becomes self-adjoint too due to the unitarity of C. A suitable factorization of £{C) 
should provide the CMV version of the Darboux transformation. 

A central role in Darboux for Jacobi matrices is played by the orthogonal polynomials 
on the real line since their recurrence relation is codified by a Jacobi matrix. The 
corresponding CMV analogue are the orthogonal Laurent polynomials on the unit circle, 
which are expected to be an essential ingredient in Darboux for CMV. Among other 
things, the orthogonal polynomials give to Jacobi matrices the meaning of a matrix 
representation of a symmetric multiplication operator. This is also true for orthogonal 
Laurent polynomials and CMV matrices, but in this case the multiplication operator is 
unitary instead of symmetric. 

A special feature of Darboux for CMV matrices is the apparition of spurious transfor¬ 
mations involving non-CMV (actually, even non-unitary) matrices with no band struc¬ 
ture, which are related to non-orthogonal Laurent polynomials. Therefore, we will need 
to deal with general ordered sequences of Laurent polynomials (assuming no orthogo¬ 
nality property) and related matrix representations of a multiplication operator, which 
forces us to work in a general setting, avoiding any a priori assumption of a Hilbert 
space structure. This means that the space of functions that we will consider is simply 
the complex vector space of Laurent polynomials A = span {z n } ne z = C[z, z -1 ]. Be¬ 
sides, we will work with general infinite matrices not necessarily attached to operators 
on Hilbert spaces, a fact that requires a special care with matrix manipulations because 
the algebra of general infinite matrices suffers from certain pathological problems [13] . 
We will briefly describe some of these problems below with the aim of stating a simple 
setting where such pathologies can be well controlled by a few easy rules. This will 
be enough for our purposes, but the reader can find more comprehensive treatments of 
infinite matrices in classical references like P21 ; 2>j or the recent review [HH]. 

The first difficulty of dealing with infinite matrices is that matrix products can be 
ill defined, as it is the case of S t S where 


' 0 0 0 0 
10 0 0 
110 0 
1110 




( 1 ) 


To avoid this problem we will restrict matrix products to those whose coefficients can 
be computed with a finite number of algebraic operations on the matrix coefficients of 
the factors. 
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Definition 2.1. We will say that the product AB of two matrices A, B is admissible if 
any matrix entry (. ABA t jB hk involves only a finite (i, ^-dependent) number 
of non-null summands. 

The above definition applies not only to square matrices, but also to rectangular 
ones, so we can talk about admissible products between matrices and vectors, or between 
vectors. In what follows we will consider only admissible matrix products. 

The following special type of infinite matrices, which will arise in the Darboux 
transformations for CMV matrices, provide admissible matrix products. 

Definition 2.2. We will say that a matrix A is lower (upper) Hessenberg type if A hJ 
can be non-null only for j — i ^ N (i — j ^ N ) for some N e Z. 

Graphically, lower Hessenberg type matrices are characterized by the shape 

( **■■■* o o o o 
**■■■** 0 0 0 
**■■■*** 0 0 
**•••**** o 



while upper Hessenberg type matrices correspond to the transpose of this structure. 

Hessenberg type matrices are not necessarily square matrices. In particular, any 
column (row) vector is trivially a lower (upper) Hessenberg type matrix, and if it has 
finitely many non-null components then it is also an upper (lower) Hesssenberg type 
matrix. 

Any product AB with A lower Hessenberg type or B upper Hessenberg type is 
admissible. Instances of this are ST = I = TS and S t T = T — /, where / is the infinite 
identity matrix, S is given in (JT]) and 


T 


/ 1 

0 

0 

0 

0 -x 

1 - 1 

1 

0 

0 

0 \ 

0 

-1 

1 

0 

0 ... 

: o 

: o 

-1 

1 

1 


( 2 ) 


Indeed, the set of lower (upper) Hessenberg type matrices is closed under multiplication. 
Band matrices are precisely those which are simultaneously upper and lower Hessenberg 
type, thus they are closed under multiplication too, providing always admissible matrix 
products regardless whether they act as left or right factors. 

Some of the properties of the multiplication of finite matrices are also valid for admis¬ 
sible products of infinite matrices. For instance, the distributive property A(B + C) = 
AB + AC holds as long as the products AB and AC are admissible, and analogously 
for ( B + C)A = BA + CA. Besides, if AB is admissible, then B + A + is admissible too 
and {AB) + = B + A + , where A + denotes the adjoint (transpose conjugate) of A. 

However, even if all the involved matrix products are admissible, the associativity 
can fail, as it is shown by the example 

S t (TS) = S l I = S\ (S t T)S = (T — I)S = I — S. 


7 












The root of this problem is the non-associative character of oscillatory series which 
appear in the multiplication process. To see this clearly, let us fix our attention on the 
(0,0)-entry of the above products. Denoting X = (1,1,1,...), 


(S t (TS)) 0,0 = X(TX*) = £ Xj ( ^ T jtk X k ) = X 0 2 + £ X^-X^ + X, 

o \fc>0 / i 

((S t T)S)o,o = (XT)X t = 2 (S X i T i,k) X * = - X k+ i)X k = 0, 

fc^O \j>0 / 


= 1 , 


thus the different value of these entries is a consequence of the non-associativity of the 
oscillatory series 1 — 1 + 1 — 1 + ---. 

We can guarantee the associativity of matrix products requiring the absence of 
infinite series in the multiplication process. More precisely, assuming that the products 
(. AB)C and A(BC) are admissible, a sufficient condition for the associativity ( AB)C = 
A(BC) is the existence for any indices i, l of only a finite (i, /-dependent) number of 
non-null terms A l J B j k Ck i since in that case 



due to the presence of a finite number of non-null summands in all the sums. 

The presence of Hessenberg type matrix factors which provide admissible matrix 
products also guarantee their associativity according to the following simple rules. 

Proposition 2.3. The associative property ( AB)C = A(BC) of a matrix product is 
valid in any of the following cases: 

1. A and B are lower Hessenberg type. 

2. B and C are upper Hessenberg type. 

3. A is lower Hessenberg type and C upper Hessenberg type. 

Proof. Suppose for instance that A and B are lower Hessenberg type. Then, AB is ad¬ 
missible and lower Hessenberg type too, thus (AB)C is admissible. On the other hand, 
BC is admissible since B is lower Hessenberg type, thus A(BC) is admissible because 
A is also lower Hessenberg type. Besides, clue the lower Hessenberg type structure of A 
and B. 


A % 3 can be non-null only for j ~ i ^ N 
B j k can be non-null only for k — j ^ M 


AijBj :k C ky i can be non-null only for 
j ^ i + N and k ^ i + N + M. 


This ensures the associativity according to the sufficient condition given above. A 
similar argument proves the associativity in the two remaining cases. □ 



Another pathology of admissible products of infinite matrices is that, in contrast to 
the case of finite matrices, the existence of inverse can be consistent with a non-trivial 
left or right kernel. This is illustrated by the matrix T given in ((2]) which, despite having 
the matrix (jT]) as an inverse, satisfies 

(1,1,1,...)T = 0. 

In what follows we will use the following notation for the left and right kernel of a 
matrix A e C MxN , M, N e N u {oo}, 

ker L (A) = {X e C lxM : XA = 0}, ker fi (A) = {X e C Nxl : AX = 0}, 


where we understand that the vectors X are such that the products are admissible. 
So, in the case of the matrix T introduced in (J2]), ker L (T) = span{(l, 1,1,...)} and 
ker r (T) = {0}. 

The above fact also reveals possible uniqueness problems for the inverse of a general 
infinite matrix because such an inverse could be modified by adding a matrix with rows 
and columns lying on the left and right kernel respectively. An example of this is given 
by 

i 

-2 1 
1-2 1 


which satisfy JK = I = KJ for any ceC because ker L (J) = span{(l, 1,1,...)} and 
kerft( J) = span{(l, 1,1,... )*}. 

The previous example not only shows that a Hessenberg type matrix can have mul¬ 
tiple inverses, but also that these inverses can be non-Hessenberg type. Nevertheless, if 
a lower (upper) Hessenberg type matrix A has a lower (upper) Hessenberg type inverse 
H, no other inverse exists. For instance, if A and B are lower Hessenberg type, due to 
Proposition 12.31 1. any other inverse C should satisfy C = ( BA)C = B(AC) = B. In the 
upper Hessenberg case the associativity requirements force us to modify the uniqueness 
proof as C = C(AB) = ( CA)B = B according to Proposition 12.31 2. This result is the 
origin of the following definition. 

Definition 2.4. An infinite lower (upper) Hessenberg type matrix A with a lower 
(upper) Hessenberg type inverse will be called a lower (upper) H-matrix. Since the 
inverse is unique in this case, it will be denoted by A -1 . 

Examples of H-matrices are S and T given in (JTJ) and (121) , with S = T" 1 . Although 
H-matrices could seem quite scarce at first sight, we will see that they are precisely 
the kind of matrices arising in the CMV version of Darboux transformations. Actually, 
CMV matrices themselves are examples of matrices which are simultaneously lower and 
upper H-matrices. 

A triangular infinite matrix with non-zero diagonal entries is a special case of H- 
matrix because it always has a triangular inverse. The condition on the diagonal guar¬ 
antees for the leading submatrices the existence of triangular inverses obtained by en¬ 
larging the smaller ones. An induction process on the size of the leading submatrices 
generates the triangular inverse of the infinite triangular matrix. 



K = 



+ c 
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Iii general, we will denote by A -1 the inverse of a matrix A whenever this inverse is 
unique. Some of the rules for the manipulation of inverses of finite matrices also hold 
for infinite ones as long as they have a unique inverse. For instance, taking adjoints we 
see that AB = I = BA is equivalent to B + A + = I = A + B + , thus A and A + have a 
unique inverse simultaneously, and in this case (A + ) _1 = (Ar 1 )" 1- . In particular, this 
equality is valid for any H-matrix. 

H-matrices are essential in the Darboux transformation for CMV matrices since they 
allow us to perform standard algebraic manipulations which are forbidden for general 
infinite matrices. For instance, solving a simple matrix equation in the following way, 

AB = C => B= ( A~ l A)B = A~\AB) = A~ X C, (3) 

is possible if A is a lower H-matrix due to Proposition 12.31 1. but it is not possible for 
general infinite matrices. As an application of these ideas, we have the following result. 

Proposition 2.5. If A is a lower (upper) H-matrix, then ker#(A) = {0} (ker^(A) = 
{0}) and ker L (A)\{0} (ker R (A)\{0}) does not contain finitely non-null vectors, i.e. vec¬ 
tors with finitely many non-null entries. 

Proof. Suppose that A is a lower H-matrix. Using ([3]) we find that AX = 0 => X = 0. 
On the other hand, if A is a finitely non-null row vector, it can be considered as a 
lower Hessenberg type matrix, thus XA = 0 => X = X i(AA _1 ) = (XA)A~ l = 0 due to 
Proposition 12. 31 1. The result for upper H-matrices follows analogously from Proposition 
01 2 . □ 

The set of lower (upper) H-matrices is closed under multiplication and inversion, and 
contains the identity matrix. Hence, this set is a multiplicative group since associativity 
holds for lower (upper) Hessenberg type matrices due to Proposition 12. 31 1 (Proposition 
12.31 21. Actually, the previous results show that the set of infinite lower (upper) Hessen¬ 
berg type matrices with the sum and multiplication is a ring whose group of units is the 
set of lower (upper) H-matrices. Interesting subgroups of this multiplicative group are 
the sets of infinite lower (upper) triangular matrices with positive or non-null diagonal 
entries. 

Following the general idea of the Darboux transformation, and mimicking the Jacobi 
case according to previous comments, Darboux for CMV should deal with a factorization 
1(C) = AB for a Hermitian Laurent polynomial I(z) = az + fi + az^ 1 (a e C, fi e M), and 
a matrix C (eventually a CMV matrix) representing in some basis of A the multiplication 
operator 

z: A->A (4) 

f(z) >— zf(z ) 

Different matrix representations C of this multiplication operator are related by 
changes of basis and give rise to different factors A, B. Besides, we will see that the 
factors A, B themselves are closely related to certain changes of basis in A. Thus, 
changes of basis will play a central role in the factorizations related to the Darboux 
transformation for CMV matrices. 
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In the following section we will analyze the matrix representations of the multipli¬ 
cation operator (j3J) for bases of A with a structure similar to that one of a CMV basis, 
but assuming no orthogonality requirement. We will see that these representations are 
H-matrices, a key result to avoid the pathologies of general infinite matrices. 


2.1 Matrix representations of the multiplication operator 

A CMV matrix can be understood as the matrix representation of the multiplication op¬ 
erator (Jl]) with respect to a basis y = (xo> Xu X 2 , ■ ■ ■ Y °f orthonormal Laurent polynomi¬ 
als. This basis is obtained by applying to the canonical one 77 = (1, z, z -1 , z 2 , z~ 2 ,.. .)* 
the Gram-Schmidt orthonormalization with respect to a positive Borel measure sup¬ 
ported on an infinite subset of the unit circle (hereinafter ‘measure on the unit circle’). 
Thus, the orthonormal basis x satisfies 

j span {z ~ k ,..., z k } (coefficient of z~ k > 0 ), n = 2k, 

^ n n I span{z~ fc ,..., z k+l } (coefficient of z k+1 >0), n = 2k + 1. 


Due to the needs of Darboux for CMV pointed out previously, we will consider in this 
section bases x °f A satisfying simply (J5]), regardless of their orthogonality properties. 

Definition 2.6. A basis y of A satisfying (J5]) will be called a zig-zag basis. 


The nested structure of the subspaces L n implies that, for any zig-zag basis y, 


Z X0 e Iu\Lo, ZXn e 


L 


n+1 j 


even n, 


L n+ 2 \L n+ i, odd n. 


This can be rewritten in matrix form as 


Vt = Cy, 


C = 


+ 

* 

* 

* 

* 

* 

* 


+ 

* 

* 

* 

* 

* 


+ 

* 

* 

* 


+ 

* 


( 6 ) 


where the coefficients denoted by + are positive and the omitted ones are zero. The 
matrix representation C of the multiplication operator ((4j) in a zig-zag basis y is therefore 
lower Hessenberg type. 

The inverse z -1 of the multiplication operator is represented in a zig-zag basis y by 
a lower Hessenberg type matrix too. Indeed, 


z Vo e L 2 \Li, 


Z Xn *= 


lbn+1) 


even n, 
odd n, 
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or equivalently 


z X X = Cx, 



* + 

* * 

* * * + 

* * * * 

* * * * 

* * * * 



(7) 


On the other hand, from Proposition 12.31 1 we find that (CC — I)x = (z 1 z — l)x = 0, 
thus CC = I by the linear independence of y. A similar proof yields CC = I. Hence, C 
is a lower Hessenberg type inverse of C, which thus has a unique inverse C = C. 

In consequence, the matrix representation C of the multiplication operator in a zig¬ 
zag basis y is always an H-matrix. 

The simplest case corresponds to the canonical basis 77 , which leads to 


zr] = Sr /, 


5 


0001 

1000 

000001 

001000 


( 8 ) 


The matrix S is unitary, i.e. <S _1 = tS + . We will refer to S as the shift matrix since it 
represents the multiplication shift operator z n *-> z n+1 with respect to the basis rj. 

Any zig-zag basis y is characterized up to a positive factor by the corresponding 
matrix representation C of the multiplication operator. A way to understand this is to 
note that C determines C = C -1 , and the first and even equations of (J6]) combined with 
the odd equations of (J7J) give each Laurent polynomial y n as a linear combination of 
the previous ones y 0 ,..., Xn-i and 


zx 0 71 = 0, 

zXn-2 odd n, 

z~ 1 Xn -2 even n > 0. 


Therefore, C and y 0 determine y n for n ^ 1. 

The matrix representations of the multiplication operator in a zig-zag basis do not 
cover all the H-matrices with the shape given in (If)]) . A simple counterexample is 
obtained by slightly perturbing the shift matrix changing 0 by 1 in the (1,2) and (2,3) 
coefficients, 


H 


/° 1 m 

0 0 m 1 

1 0 0 CD 


0000 

0010 

0000 

0000 


0 1 
0 0 

0001 

1000 


\ 


? 


/ 


hr 1 


/ 0 -1 1 0 © 

100 
00001 
0 1 00-1 
0000001 
0001000 
,000000001 


\ 


/ 


Thus, H is an H-matrix with the shape (]6]h but its inverse does not fit with the structure 
(JTJ) due to the encircled coefficient 1 at site (0,4). 
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Changes of basis allow us to specify the set of matrix representations under study. 
By definition, zig-zag bases x are those related to the canonical one rj by 


X = T V, 


T e & 


set of lower triangular infinite matrices 
with positive diagonal entries. 


Therefore, Proposition 12.31 1 leads to (CT — TS)rj = C\ — zy = 0, so the linear in¬ 
dependence of rj implies that CT = TS. Then, using again Proposition 12.31 1 yields 
C = TST -1 , where parenthesis are omitted due to the associativity of the product. 

Hence, the matrix representations of the multiplication operator in a zig-zag basis 
constitute the equivalence class 


[5] = {TST’ 1 :Te^} 

of the shift matrix with respect to the relation “conjugation by the subgroup 2T'\ 
which is an equivalence relation in the multiplicative group of lower H-matrices due to 
the associativity properties of Proposition 12.31 

Another subgroup of H-matrices with special interest for us is the set of lower (upper) 
H-matrices A which are unitary, i.e. A -1 = A + . Every such a matrix A is necessarily 
banded because A = (A _1 ) + is simultaneously lower and upper Hessenberg type. This 
ensures that both A + A and AA + are admissible products for unitary H-matrices. How¬ 
ever, analyzing the unitarity of an H-matrix whose band structure is not guaranteed 
a priori requires avoiding to deal with the product A + A (AA + ) for a lower (upper) 
H-matrix A, since it could be non-admissible. Nevertheless, the following result shows 
that the unitarity of an H-matrix can be checked resorting only to admissible products. 

Proposition 2.7. A lower (upper) H-matrix A is unitary iff AA + = I (A + A = I). 

Proof. Suppose that A is a lower H-matrix, so that it has a unique inverse A -1 and 
this inverse is lower Hessenberg type. Then, A" 1 = A + implies that AA + = I. Con¬ 
versely, AA + = I leads to A -1 = A _1 (AA + ) = (A _1 A)A + = A + , where we have used 
Proposition 12.31 1. A similar proof runs for upper H-matrices. □ 

The unitary representatives of the class [<S] have a special meaning. 

Proposition 2.8. If C e [5] is unitary, then it is a band matrix with zig-zag shape 

* + 

* * * + 

+ * * * 

* * * + 

+ * * * 

* * * + 

+ * * * 

Proof. Bearing in mind the shapes of C and C = C^ 1 given in (JSJ) and (JTj). respectively, 
the result follows directly from the equality C + = C -1 . □ 


(9) 
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It was proved in [121 Theorem 3.2] that the unitary matrices with the zig-zag shape 
(191) are exactly the CMV matrices. Therefore, we get the following identification, which 
provides an alternative definition of CMV matrices. 

Corollary 2.9. C is a CMV matrix iff C e [«S] and is unitary. 

The previous result is an algebraic characterization of CMV matrices based on a 
previous characterization in terms of their shape. An explicit parametrization of CMV 
matrices is known, and we will introduce it later on. 

Zig-zag bases which are orthonormal with respect to a measure on the unit circle 
yield the CMV matrix representations of the multiplication operator. Bearing in mind 
the one-to-one correspondence between elements of [<S] and zig-zag bases of A (up to a 
positive factor), Corollary [279] implies that the unitarity of an element of [5] is equivalent 
to the orthonormality of the corresponding zig-zag basis with respect to a measure on 
the unit circle. 

The class [<S] constitutes the framework where we will develop the factorization 
leading to the Darboux transformation of CMV matrices. The study of this factorization 
is the objective of the following section. 

2.2 Change of basis and Darboux factorization 

Let us introduce the Darboux factorization for the class [<S]. For this purpose, consider 
a Hermitian Laurent polynomial £ e A\{0}, i.e. such that £* = £, where the substar 
operation in A is defined by 

C{z)-.= Ki/¥). (io) 

The general form of such an £ is 

d 

£{z) = ^ otjzf ctj e C, a-j = cij, ad V 0. 
j=-d 

Although most of the discussions along the paper are for the simplest non-trivial case, 
d = 1, bearing in mind future extensions to d > 1 (see Section 18.11) . we will not 
assume any restriction on d in the present section. For convenience, sometimes we 
will distinguish the ‘degree’ d of £ using the number N z = 2d of zeros of £ (counting 
multiplicity) rather than the value of d itself. 

Due to the ring structure of the set of infinite lower Hessenberg type matrices, £{C) 
is of this type for any C e [«S]. The alluded factorization will have the form £{C) = AB 
for some lower Hessenberg type matrices A, B. The hermiticity of £ ensures that £(C) 
is self-adjoint for a unitary C, i.e. when C is a CMV matrix. This situates the Darboux 
factorization of CMV matrices close to the standard setting of Darboux for self-adjoint 
operators. 

Let x be a zig-zag basis related to C e [«S]. We will see that any choice of a new 
zig-zag basis oj generates a factorization of £(C). First, note that x and u are related 
by a triangular change of basis with positive diagonal entries, i.e. 

X = Au, A e ST. (11) 
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A new matrix B comes from expressing Ecu in the basis x, 


Ecu = B X . (12) 

While A is a lower H-matrix because it lies on AT, for the moment we only can assure 
that B is lower Hessenberg type. This last statement follows from the relation 


Ecu n e ^(L n \L n _!) c= L A r 2+n \Ljv 2+n _ 1 , N z = number of zeros of E, 


which shows that B has the shape 


B 


N z 

I-1 


* * 
* * 
* * 


* * ® 

* * * (g) 

* * * * (g) 


J 


(13) 


where the symbol © stands for a non-null coefficient. 

The matrix B is never a lower H-matrix for a non-constant E because it has a non¬ 
trivial right kernel. This follows from (TT2]h which implies that B X (Q = 0 for any zero C 
of t. Actually, B may have no inverse at all. This occurs for instance if E{ 1) = 0, cu = rj 
and x = Tr/ with T given by ([2D, because then 0 = B X ( 1) = £>(1, 0, 0,... )* so that the 
first column of B vanishes. Nevertheless, we will see later on that in cases of interest B 
becomes an upper H-matrix with a band structure. 

The matrices A and B are the key ingredients of a factorization not only of E(C), 
but also of E(T>), where T> e [«S] is the representation of the multiplication operator in 
the basis cu. 


Proposition 2.10. LetC,T> be the matrix representations of the multiplication operator 
with respect to the zig-zag bases x, cu respectively, and let E be a Hermitian Laurent 
polynomial. Then, the matrices A, B defined by ([TTD and (TT2|) satisfy the following 
identities: 

1. E(C) = AB and E(D) = BA. 

2. CA = AT> and BC = VB. 

3. C = AVA- 1 and V = A^CA. 

Proof. From 0,0, the analogous relations for T> and T> 1 , (fTTTh (fT2l) and Proposition 
01, we obtain 

(E(C) - AB) X = lx- iAou = 0, 

(£(V) - BA)cu = Ecu — B X = 0, 

(CA — AV)cu = C X — zAcu = zx — zAcu = 0, 

(BC — VB) X = zB X — EVcu = zB X — zEcu = 0. 

Relations 1 and 2 follow from the linear independence of X an d zu. The remaining 
equalities are a consequence of the first identity in 2 and Proposition 12.31 1. □ 
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Parenthesis are omitted in Proposition 12.101 3 due to the associativity of products 
of lower H-matrices. We will use this standard convention for associative products 
in what follows, making explicit the parenthesis only when possible non-associative 
products appear. 

Proposition 12.101 1 is what we will call the Darboux factorization for the matrices 
in the class [<S]. It is worth remarking that this factorization is not defined for single 
elements of [5], but for every ordered pair of matrices in this class. Besides, there is 
a trivial dependence on the zig-zag basis chosen for each pair, since these bases are 
defined up to a positive factor. This degree of freedom amounts to the substitution 
A —> cA, B —* c~ 1 B , c > 0, in Proposition 12.101 In the next section we will study a 
particularization of these factorizations for CMV matrices. 

Proposition 12.10l 3 is a direct consequence of identifying the factor A as the change 
of basis between x an d u. We cannot formulate the analogue of this statement for the 
second identity of Proposition 12.101 2 due to the invertibility problems of B. 

The identities of Proposition 12.101 3 provide a way to obtain T> starting from C 
and the factorization i{C) = AB, as well as a way to obtain C starting from T> and 
the factorization UfD) = BA. Despite the apparent symmetry, there is a significant 
difference between these two identities when compared to the equality CA = AT> of 
Proposition 12.101 2 which originates them. As in (131) . the equation CA = Ay has a 
unique solution in the unknown matrix y because A is a lower H-matrix. However, 
the equation X A = AT> can have multiple solutions in the unknown matrix X because 
ker^(A) can be non-trivial. The general solution has the form X = C + Ka where 
C = ADA -1 and Ka is any matrix whose rows belong to ker^(A). Of course, due to 
Proposition 12. 31 1. we know that imposing a lower Hessenberg type structure on X leaves 
only the solution C. The conclusion is that no non-null lower Hessenberg type matrix 
Ka can be generated from keri,(A). This is in agreement with Proposition 12.51 

We will finish this section with an exact description of ker r(B) which will be of 
interest later on. 

Proposition 2.11. Let C be the matrix representation of the multiplication operator 
with respect to the zig-zag basis x an d let £ be a Hermitian Laurent polynomial with N z 
zeros counting multiplicity. If u ; is any other zig-zag basis and B is the matrix given by 
(HI; then ker R (i(C)) = ker r(B) and a basis of this subspace is given by 

U {,\-(C).x'(C),'".x |m ' 1 ) (C)}. 

f zero of t 
m multiplicity of ( 

Moreover, a matrix of JPb = {K = K + : BK = 0} vanishes iff its leading submatrix of 
order N z is null. 

Proof. First of all note that, due to the structure (TT51) of B , dim kei'rf B) cannot be 
greater than the number N z of zeros of I. 

Besides, using the fact that A is a lower H-matrix and B is lower Hessenberg type we 
find from Proposition 12.31 1 that ( AB)X = 0 <£» BX = 0. Bearing in mind Proposition 
12.101 1. this means that ker/j(T(C)) = ker r(B). 
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Taking derivatives in (fl2|) we conclude that X^HO e ker r(B) for any zero £ of £ and 
any j = 0,1,..., m — 1, smaller than its multiplicity m. Since the set 

U {x(0,X''(C),...,x ( "‘- 1) (C)l 

£ zero of i 
m multiplicity of £ 


has N z vectors lying on kein(B) and dimkerR(I?) < N z , to prove the first statement of 
the proposition it only remains to see that this set is linearly independent. 

For this purpose it is enough to show that det Ajv z 0 where, denoting by £i,..., £& 
the zeros of £, Ajv z is the leading submatrix of order N z of the matrix A of order cox At 
given by 


A - (A(Ci),A(Cs),...,A«n)), 


A(C) = (x(C),x'(C)....,x <m - 1) (C)). 

m = multiplicity of £ as a zero of i. 


If det Atv z = 0 then there exists a row vector X # 0 such that XAjv z = 0. This is 
also equivalent to state that the Laurent polynomial / = X(xo, Xi, • • ■, Xn z -iY satisfies 
/^(C) = 0 f° r any zero £ of £ and any j = 0,1,..., m — 1, smaller than its multiplicity 
m. In other words, / should have at least N z zeros, counting multiplicities. This is 
impossible because / is a linear combination of Xo, ■ ■ ■, Xn z - i, thus it has no more than 
N z — 1 zeros due to the zig-zag structure of the basis x- 

Suppose now that an infinite square matrix K satisfies BK = 0. This is equivalent 
to state that the columns of K belong to ker^(i?), i.e. I\ = AV for a matrix V of size 
N z x oo. Indicating by the subindex N z the leading submatrix of order N z , we have that 
K Nz = A Nz V N z . Since det A Nz ^ 0, the condition K Nz = 0 implies that Vn, = 0, so the 
first N~ columns of V and K are null. If, besides, K = K + , then the first N z rows of I\ 
vanish too, which can be expressed as A Nz V = 0. The condition det A Nz ^ 0 implies 
that V = 0, thus K = 0. □ 

We have described the Darboux factorization in the general framework of matrix 
representations of the multiplication operator with respect to zig-zag bases. In the next 
section we will explore some consequences of the previous results for the case of CMV 
matrix representations, i.e. when the zig-zag bases are orthonormal with respect to a 
measure on the unit circle. Our purpose is to show that suitable changes of basis connect 
standard factorizations (usual in the context of Darboux for self-adjoint operators) with 
well known transformations of measures on the unit circle. 


3 Darboux for CMV and Christoffel transforma¬ 
tions 

In this section we will analyze an especially interesting Darboux factorization for CMV 
matrices which is behind the CMV version of Darboux transformations. 
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The zig-zag bases leading to CMV matrices are precisely those which are orthonormal 
with respect to a measure on the unit circle. This establishes a one-to-one correspon¬ 
dence between CMV matrices and such measures, up to normalization. 

Apart from introducing Darboux transformations for CMV matrices, one of the 
purposes of the present section is to translate these transformations to the corresponding 
measures. However, in tackling the CMV version of Darboux transformations we will 
need to deal with zig-zag bases which are not necessarily orthogonal with respect to a 
measure on the unit circle. This forces us to work in the more general setting of linear 
functionals in A, a subset of which can be identified with the set of measures on the 
unit circle. Besides, this general setting will allow us to simplify the notation along 
the paper, providing at the same time a direct extension of Darboux transformations 
to quasi-CMV matrices related to quasi-definite functionals which are not necessarily 
associated with positive measures (see Section 18.2j) . Thus, we will first comment on the 
relation between measures on the unit circle and linear functionals in A. 

Any measure /i on the unit circle generates a linear functional u in A defined by 
u[f] = $ /d/i, so that no different measures give rise to the same functional [35]. For 
convenience, we will write u = dfj, to indicate the functional u generated by fi. Behind 
this notational convention lies the identification of a measure on the unit circle and 
the corresponding functional, much in the same way as in the case of functions and 
distributions. 

The inner product in A associated with the measure /i can be rewritten in terms 
of its functional u as ^ fg d/j, = «[/<?*], the substar operation in A being as in (fTOlh 
This provides a one-to-one correspondence between measures on the unit circle and 
positive definite Hcrmitian linear functionals in A, i.e. those linear functionals u: A —» C 
satisfying 


u = w* (Hermitian), u [ff*\ >0, / e A\{0} (positive definite), 

where the substar operation is defined for linear functionals in A by 

«*[/] = 4 /*]> / eA - 

Equivalently, u*|A n ] = u[z~ nl \ for every n e Z. 

We will extend the substar operation to Laurent polynomial matrices M(z) by 
M*(z) := M(lfz), defining also the new operation M + (z ) := M*(z)* = M(l/z) + . 
For convenience, in what follows M + will refer to M + (z), while the adjoint of M(z) will 
be explicitly denoted by M(z) + . Then, using the natural notation u[M] = (u[Mij]), 
the orthonormality of a zig-zag basis y with respect to a measure p, on the unit circle 
can be compactly expressed as w[xx + ] = I in terms of the functional u = d/r. This 
orthonormal basis y is completely determined by the positive definite functional u. 

If y is the zig-zag basis which is orthonormal with respect to a measure /i, the 
functional u = dji is determined by the conditions u[XnX o*] = <5n,o- Actually, these 
conditions determine a linear functional u in A for any zig-zag basis y (hereinafter 
‘the functional of the zig-zag basis’). Since the zig-zag basis y of a given matrix C e 
[«S] is determined up to a positive factor, this associates a unique functional u, up to 
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normalization, to any matrix C e [«S] . When C is a CMV matrix, u is the orthogonality 
functional of y, otherwise u can be even non-Hermitian. 

The Gram matrix of a general linear functional u in A with respect to an arbitrary 
basis l = (lo,h, ... )* of A is defined as u[ll + ], so that it coincides with the standard 
Gram matrix of an inner product when u is positive definite. The Hermitian functionals 
are precisely those with a Hermitian Gram matrix and, among them, the positive def ini te 
functionals are characterized by a positive def ini te Gram matrix, i.e. Xu[ll + ]X + > 0 
for every finitely non-null row vector X ^ 0 (equivalently, the leading principal minors 
of the Gram matrix are all positive). Therefore, a basis l of A is orthonormal with 
respect to a measure on the unit circle iff there exists a linear functional u such that 
u[ll + ] = I because the fact that this Gram matrix is trivially Hermitian and positive 
definite implies that u is Hermitian and positive definite. 

Any transformation of a measure on the unit circle can be understood as a trans¬ 
formation of the corresponding functional. For instance, if a Laurent polynomial £ is 
non-negative in the support of a positive measure p on the unit circle, then £d/j is again 
a positive measure with associated functional u£[f] := u[£f\. We use this identity to 
extend the meaning of u£ to any linear functional u in A and any £ e A, non of them 
necessarily Hermitian. Then, it easy to see that (u£)* = u*£*. 

Let us introduce now the specific factorization which will be the origin of the Dar- 
boux transformations for CMV matrices. 

Given a CMV matrix C, suppose that a Hermitian Laurent polynomial £ of degree 
one makes the self-adjoint matrix £{C) positive definite. This is equivalent to state that 
there exists a Cholesky factorization 


£(C) = AA + , 


which is known to be unique (see Appendix 1X1). 

Moreover, in view of the zig-zag shape (JHJ) of C, the matrix £{C) has the five-diagonal 
structure 


m 


* * © 

* * * © 
® * * * 
© * * 


© 

* © 


\ 

/ 


(14) 


with non-null entries in the upper and lower diagonals. This implies that A is not only 
lower triangular with positive main diagonal, but also 3-band with non-null entries in 
the lower subdiagonal, 


A 


+ 

* + 


(15) 




•v ./ 


The above Cholesky factorization is a particular case of the Darboux factorization 
analyzed in the previous section. To show this, consider a zig-zag basis y related to C 
and a new one co given by y = Aoj. Then, Proposition 12.101 1 shows that £{C) = AB, 
where B is the lower Hessenberg type matrix defined by £oj = B y. Since £{C) = AA + 
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and A is a lower H-matrix we conclude by Proposition 12.31 1 that B = A + . Hence, if V 
is the matrix representation of the multiplication operator in the basis oj, Proposition 
12.101 reads in this case as 


£(C) = AA + , 

£{V) = A + A, 

(16) 

CA = AV, 

C + A = AV + , 

(17) 

C = AVA~\ 

II 

i 

(18) 


Thus, the triangular matrix A coming from the Cholesky factorization of £(C) provides 
a ‘reversed’ Cholesky factorization of £(V). Nevertheless, as we pointed out after Propo¬ 
sition 12.101 the matrix V can be obtained directly via the second identity in (fl8|i . Note 
that the reversed factorization is admissible because A is a band matrix. The freedom 
of the zig-zag basis x cx i n a positive factor c changes u —» au, so it does not alter 
the matrix V neither the relations (fTBI) . (fl7j) . (TTST) . 

Summarizing, any Hermitian Laurent polynomial l of degree one defines a mapping 
C V between the set 


^ := {C CMV : £(C) is positive definite} 

and the class [<S]. This mappping satisfies (TT6l) . (ITT)) . (fT8l) with A as in (fT5]l . and, up 
to a positive rescaling, any zig-zag bases x, u associated with C, V, respectively, are 
related by 

X = Auj , £u = A + x- (19) 

The mapping C *-^> T> can be defined using the first relation of (fTTfl) and the second one 
of (fT8]i together with the condition A e 3F. 

t 

The following questions regarding the mapping C *->• V appear naturally: 

(Ql) When is V a CMV matrix? This is equivalent to ask about the unitarity of V, 
or, alternatively, about the orthonormality of uj. 

(Q2) Is C ^ D an isospectral mapping? If not, what changes may it induce in the 
spectrum? 

(Q3) What is the relation between two linear functionals u, v associated with the ma- 

t 

trices C. T>‘! In other words, what is the transformation u >-> v generated by the 
mapping C ^ VI 

(Q4) Relations (JED yield x.n e span{w n _ 2 ,a; n -i,ct; n } and £u> n e span{x„, Xn+i, Xn+2}- 

Does any of these conditions characterize the relation between C and V given by 

£ 

the mapping C *-» V‘! 

(Q5) It is known that CMV matrices are parametrized by sequences of complex numbers 
in the unit disk, the so-called Schur parameters. If V is a CMV matrix, how is 
the relation between the Schur parameters of C and V encoded in the factor A? 
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The aim of the rest of this section is to answer these questions. 

The answer to (Ql) follows easily from the properties of the mapping C V, using 
Proposition 12.31 to take care of the associativity. 

i 

Proposition 3.1. The mapping C >->• T> preserves the set i.e. it transforms every 
CMV matrix C with £{C) positive definite into a CMV matrix V with £(T>) positive 
definite. 

Proof. Since C is a CMV matrix, from the second relation in (TTTll and Proposition 12.31 1 
we obtain T> + = A~ X C + A. Combining this identity with the second relation in (jT8j) 
and bearing in mind that the product of lower Hessenberg type matrices is associative, 
yields VV + = A~ 1 CAA~ 1 C + A = I. Thus, Proposition 12.71 implies that V is unitary and 
hence CMV. 

From the second identity in (fTUjh and taking into account that any finitely non-null 
row vector X is a lower Hessenberg type matrix, using again Proposition 12.31 1 we find 
that X£(V)X + = XA+AX+ = ||AA'+|| 2 ^ 0, so X£(V)X+ = 0 iff AX+ = 0, which 
implies that X = 0 because A is an H-matrix. Therefore, £(T>) is positive definite. □ 

t 

We will refer to the mappings C >-> T> as the Darboux transformations for CMV 
matrices. Both, the definition of these mappings (which uses the Cholesky factorization 
of £{C )) as well as their domains depend on the choice of the Hermitian Laurent 
polynomial £. Nevertheless, a rescaling £ —* c£, c > 0, changes A —> \fcA , so it yields 
the same transformation C V. 

An important consequence of Proposition 13.11 is that it guarantees that Darboux 
transformations of CMV matrices can be iterated because ^ constitutes the class of 
CMV matrices C for which the Cholesky factorization of £(C) exists (see Appendix [All. 

As for (Q3) and (Q4), Proposition 13.11 states that the transformation u <—>■ v pre¬ 
serves the positive definiteness of the functionals, so when answering these questions 
we can suppose without loss of generality that u, v are positive definite and y, oj the 
corresponding orthonormal zig-zag bases. The answer to (Q3) and (Q4) is given by the 
following theorem, which is the main result of this section. 

Theorem 3.2. Let u, v be positive definite functionals in A with orthonormal zig-zag 
bases y, in and CMV matrices C, V respectively. Then, the following statements are 
equivalent: 

(i) C V for some Hermitian Laurent polynomial £ of degree one, i.e. C e ^ and 
T> = A~ 1 CA, where A e ST comes from the Cholesky factorization £{C) = AA + . 

(ii) v = u£ for some Hermitian Laurent polynomial £ of degree one. 

(in) x n e span{o; n _ 2 , u> n -i, <w} for n ^ 2 and y n f span{u; n } for some n. 

(iv) There exists a Hermitian Laurent polynomial £ f span{y!, y 2 } of degree one such 
that £u n e span{y n , Xn+u Xn+ 2 } for n ^ 0. 
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The Laurent polynomials £ mentioned in (i) and (ii) coincide up to a constant positive 
factor, and they also coincide with that one in (iv) up to a constant real factor. More¬ 
over, if £ coincides exactly in (i) and (ii) then x = Auj and £uj = A + x without any 
rescaling. 

Proof. The implications (i) =^> (Hi) and (z) =^> (iv) are already proved because we know 
that, up to a positive rescaling, (i) gives the relation (fT§D between \ and and A has 
the structure (1131) . 

(*) (m) 

Assuming (i), we can suppose without loss of generality that x = Au and £u = A + x 
by rescaling £ with a positive factor if necessary. Then, bearing in mind that any column 
(row) vector is a lower (upper) Hessenberg type matrix and using Proposition 12.31 we 
find that uh\ojL>j + ] = u[lo(£oj) + ] = A _ 1 u[yx + ]A = /. Due to the uniqueness of the 
orthonormality functional of u, we conclude that v = u£. 

Suppose now that v = u£. Then, (J6J) and Proposition 12.31 imply that u[xx + ] = 
it[£xx + ] = £(C)w[xx + ] = £(C). This means that, assuming u positive definite, v is 
positive definite too iff C s r ££(. Therefore, the hypothesis of the theorem guarantee the 
existence of the Cholesky factorization £(C) = AA + . Also, x = A*; because, using again 
Proposition 12.31 we get v[(A~ 1 x)(A~ 1 x) + ] = A~ l v[xx + ](^ + ) 1 = A~ 1 £(C)(A + )~ l = I. 
Then, the previous results prove that tui = A + x and V = A~ L CA. 

(Hi) => (ii) 

Let us look for a solution £(z) = az + /3 + -]z~ l of u£ = v, once (in) is assumed. 
First, for any Laurent polynomial of degree not bigger than one, (u£ — u)[yn] = 0 
when n ^ 3 because, from (ED and the five-diagonal structure ffl4l) of £(C), 

^Xn = (t(C)x)n e span{xn~ 2 , ■ ■ • , Xn+ 2 } => u£[Xn] = u[£Xn ] =0, U ^ 3, 

X.n e span{o; n _ 2 , w n _i,w n } => v[x n ] = 0, n ^ 3. 

Hence, u£ — v vanishes on the orthogonal complement of L 2 with respect to u. 

Second, there is a choice of £ such that u£ — v vanishes on L 2 . Using the canonical 
basis = ( 1 , z, z-y and expressing £ = rj^ + (l3, 7 , af , the condition (u£—v)[r]^ 2 ' > ] = 0 
becomes the linear system u[rj^rj^ + ](/3, 7, ct)* = v[rj^]. Since u is positive definite, 
det(u[r)^rj^ + ]) > 0 and this system has a unique solution (a,/?, 7). 

We conclude that v = u£ for a unique Laurent polynomial £ of degree not bigger 
than one. Since u and v are Hermitian, taking adjoints we find that v = u£*. Therefore, 
the uniqueness of £ implies that £ = £*, so that 7 = a. If a = 0 , then v is proportional 
to u, thus x is proportional to cu, which contradicts the hypothesis. Therefore, £ is a 
Hermitian polynomial of degree one. 

(iv) => (ii) 

The condition £u n e span{x„, Xn+i, Xn+ 2 } implies that u£[uj n ] = u[£u) n ] = 0 for 
n ^ 1 . Hence, (uc£ — u)[u> n ] = 0 for n ^ 1 and any c e C. Since £ e L 2 \span{xi, y 2 }, 
necessarily u\£\ A 0. Therefore, we can choose c e K. such that uc£ = v because 
(uc£ — u)[wo] = 0 iff c = v [l]/zz[£], which is real because u, v and £ are Hermitian. □ 
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This theorem states that the Darboux transformations of CMV matrices correspond 

i 

to the Laurent polynomial modifications of degree one dp I—* £d\x for measures on the 
unit circle. In the particular case £(z) = (z — £) (z _1 — C)> ICl < 1, these are known as 
the Christoffel transformations on the unit circle mm, which preserve the whole set 
of positive definite functionals (i.e. is the whole set of CMV matrices in this case) 
because such an £ is non-negative on the unit circle. Nevertheless, a general Laurent 
polynomial modification dp <—>■ £dp preserves positive definiteness only when £ is non¬ 
negative on the support of p. The proof of (i) <=> (ii) in Theorem 13.21 uncovers a matrix 
characterization of such measures, which we enunciate separately. 

Corollary 3.3. Given a Hermitian Laurent polynomial £ and a positive definite func¬ 
tional u with CMV matrix C, 

u£ is positive definite o £(C) is positive definite. 

Although the proof of this result given in Theorem 13.21 was enunciated for the case 
of a degree one £, the proof for an arbitrary degree remains unchanged. This char¬ 
acterization is important when an unknown measure is determined only by its Schur 
parameters, which provide directly the corresponding CMV matrix (see Oil) ). 

The equivalence with Laurent polynomial modifications of measures gives informa¬ 
tion about the spectral behaviour of Darboux transformations. Let C be a CMV matrix 
with associated measure p and consider the Hilbert space L 2 of p-square-integrable 
functions. Then, C can be understood as the representation of the unitary multiplica¬ 
tion operator 

IV Ll-^Ll (20) 

f{z) zf(z) 

with respect to the orthonormal basis of L 2 given by a zig-zag basis y of C- Therefore, 
the spectrum of C coincides with that of V which is the support of p. The mass 
points are the corresponding eigenvalues, which are simple because their eigenvectors 
are spanned by the characteristic functions of the mass points mm- On the other 

hand, every transformation p ■-» v preserves the support of p up to the mass points 
located at the zeros of £, which do not appear in v. Therefore, we have the following 
spectral consequence of the previous theorem, which answers (Q2). 

Corollary 3.4. Given a Hermitian Laurent polynomial £ of degree one, the mapping 
C 1 —* H preserves the spectrum for every CMV matrix C e V with the exception of 
at most two points: The spectrum of V is obtained from that of C by excluding the 
eigenvalues which are zeros of £. 

In particular, C •—> T> is a isospectral transformation whenever £ has its zeros outside 
of the unit circle. Otherwise it is almost isospectral, in the sense that it preserves the 
spectrum up to finitely many points. More precisely, if £ has its zeros on the unit circle 
the only spectral changes that the transformation may produce is the elimination of one 
or two eigenvalues depending whether £ has one or two different zeros. 
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Remember that the factor B of general Darboux factorizations was only lower ffes- 
senberg type and not necessarily invertible, ffowever, in the case of the Darboux trans¬ 
formations for CMV, the factor B = A + is not only invertible but also 3-band and 
upper triangular, so it is an upper H-matrix. Therefore, in this case we can complete 
the relations (fT6]h (ITT)) . (fT8|) with the additional ones 

C = B^VB, V = BCB- 1 . 


Moreover, the relation between the factors A and B = A + gives information about the 
left kernel of A (the right one is trivial): ker^A) = ker^(R) + , where kerij(R) is given 
by Proposition 12.Ill 

Question (Q5) refers to the explicit parametrization of CMV matrices given by 
PH GUI E] (our notation is related to that of IZD3 by a n = —ct: n _ i, while we take the 
transposed of the CMV matrix primarily used in [70]), 


( — CLo a l 

®oPi 





— plO -2 

— CL1CL2 

— P2«3 

P 2 P 3 



PlP 2 

a lP 2 


02^3 





~P3«4 

— CZ3O4 

—^405 

P4P5 



P3d4 

<DP4 

— CL^d^ 

a 4 P5 


V . 


where a n e C are the so-called Schur parameters (or Verblunsky coefficients), which 
satisfy |a n | < 1 for n 5= 1, and p n = a/1 — |a n | 2 . Schur parameters establish a one- 
to-one correspondence between sequences in the open unit disk and CMV matrices, 
thus they parametrize also the measures on the unit circle up to normalization. The 
Schur parameters also determine the orthonormal polynomials ip n with respect to the 
corresponding functional u = dp via the forward and backward recurrence relations 


Pn<Pn{z) = Z(p n -i(z) + an^-i^), 
p n z<p n -i(z) = ip n (z) - a n <p*(z), 


¥>«(*) = ^Vn*(vb 


where <p* is known as the reversed polynomial of ip n . As a consequence, 


(Pn(z') T ' ' ' T Q-n)) 


1 

^n— 1 Priori’) ^0 / r 

V u[i\ 


( 22 ) 


(23) 


The orthonormal polynomials (ONP) are connected to the orthonormal Laurent poly¬ 
nomials (ONLP) by the relations 


X 2 n(z) = z n tp% n (z), X2n+l(z) = Z n ip 2n+1 (z), (24) 

which, combined with ([22]) . lead to 


p2nX.2n ~ X2n-1* + «2n,\'2n-l, p2nX2n-\ ~ X2n* ~ «2nX2n- (25) 
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Before answering (Q5) let us fix a notation concerning the mapping C >—> T), which 
will be used in the rest of the paper. 


CMV 

Functional 

ONLP 

ONP 

Schur 


c 

u = d/i 

Xn 

Pn 

On 

Pn 'sj 1 | O n | ^ 

^n 1 = PW ■ -P1^U[1] 

V 

c5 

III 

a, 

Mn 

Ipn 

bn 

On = V 1 - \ b n 2 

A ~ l = O n - ■■ 0!^V[1] 


The coefficients of the factor A will be denoted as 


A 


fr 0 
s 0 n 
to £1 r 2 
t\ s 2 r 3 

f 2 S 3 r 4 

t 3 s 4 r 5 


\ 


J 


V 




>0, s n e C, t n e C\{0}. 


(26) 


(27) 


From the explicit parametrization (l 2 Tj) of C we see that, not only the upper and lower 
diagonals of the five-diagonal matrix £{C) are non-null, but its (1,0) and ( 0 , 1 ) matrix 
coefficients cannot vanish either, so that the structure (fl4l) becomes 


£(C) 



© © 

* * © 

* * * © 

© * * * © 

© * * * © 


\ 

/ 


(28) 


This implies that not only t n ^ 0 for all n ^ 0, but also s 0 A 0. 

Using (l24|) . the relations x = Aoj and £oj = A + x read as (taking the reverse of the 
equations for even n) 


Vn(z) = t n _ 2 z^ n - 2 {z) + s n -iV n -i{ z ) + r n i/j n (z), 
z£(z)ipn(z) = r n z(p n (z) + s n ip* n+1 {z) + t n ip n+2 (z), 


If £(z) = az + /3 + az 1 , identifying in the above equalities the terms of highest and 
lowest degree and using (1231) leads to 


ftn , ^n Pn+l 

T n T 5 t n Qpn +2 : 


A„ 


ftn+2 


t ^ ^n— 1 ^ Pn 

= O n S n — 1 = CL n S n —1 j 

ftn T'n—l 


n ^ 0, ao = bo = 1. (29) 


—i ^n+ 1 fn 

Cyb n aCL n +2 X S n - OUl n j r2 S n , 
An Pn+l 
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The equation of the second line is the translation to the Schur parameters of the Darboux 
transformations for CMV matrices. The last equation is an additional constraint that 
appears when fixing the Laurent polynomial l. It is worth noting that the identities of 
the first line lead to the simple relations 


Di—1 Pn tn —1 &n 

Y'n @n tn Pn +2 

Besides, combining the two last equations in (|29|) we get 


bn. U., 


X 2 

A n -1 


(CKCZ n -|_ i 


(30) 


(31) 


a nonlinear recurrence relation connecting directly the Schur parameters a n and b n . 
Actually, given a n , (l3Tf generates inductively b n starting from b 0 = 1. 

The factorization £(C) = AA + , explicitly written in terms of (l27|h reads as 


\tn— 2 I T |s n _i|^ + r n f3 2Re(aa n a n+ i), 

Sn— ltn— 1 T T n S n Pn+l(^^n Q^n+ 2 )) TL 0, (32) 

^n,tn &Pn+lPn+2i 


which, starting from the initial conditions t_ 2 = t -1 = s_i = 0, determines inductively 
the coefficients r n , s n , t n for n ^ 0. This shows that, if a solution of (1TZ1) exists, it is 
unique. The existence of such a solution requires the positivity of /3 — 2Re(aa n a n+ i ) — 
|s n _i| 2 — |tn— 2 1^ at every induction step, and it is equivalent to the positive definiteness 
of 1(C) (see Appendix 1X1) . 


4 Inverse Darboux for CMV and Geronimus trans¬ 
formations 

In this section we will study the ‘inverse’ of the Darboux transformations for CMV 
matrices introduced in the previous section. More precisely, given a Hermitian Laurent 
polynomial £ of degree one and a CMV matrix T>. we will search for the CMV matrices 
C which are transformed into T> by the mapping C T>. In view of Theorem 13.21 this 
amounts to characterizing the CMV matrices C of the positive measures p satisfying 
£dp = dv, where v is a measure associated with T>. Corollary 13.41 implies that the inverse 
Darboux transformations are almost isospectral, since they preserve the spectrum up 
to the addition of at most two eigenvalues. 

In contrast to Darboux transformations, the inverse Darboux transformations can 
have no solution or multiple solutions. For instance, if £(z) = (z — C)(^ _1 — C)> |C| = 1) 
no measure p on the unit circle solves the equation £dp = dfl, where dd(e l6 ) = d6/2ir 
is the Lebesgue measure. On the other hand, under the same choice of £, if Idp = du, 
then £(dp + m§^) = du for any positive mass m, where 5^ is the Dirac delta at (. 
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The parametric structure of the set of solutions for the inverse Darboux transforma¬ 
tions depends on the localization of the zeros £ 1 , (2 of £ with respect to the unit circle. 
Dne to the hermiticity of £, the transformation £ — 3 ► £ leaves invariant the set of zeros 
of L Hence, given a measure v on the unit circle and a Hermitian Laurent polynomial 
£, we have the following possibilities for the solutions /i of Idfi = du: 

• Zeros outside of the unit circle: £1 = £ = £ 2 \ |£| < 1. 

Then, £(z) = c|^ — £| 2 , c e ®1\{0}, for \z\ = 1, so £d[i = du is solved by a unique 
positive measure d/i = du/£ or by no one depending whether c > 0 or c < 0. 

• Zeros on the unit circle: £1 = e* 01 , £2 = e* 02 . 

Then, £(z) = c sin(^dA) sin(^y^), ce M\{0}, for z = e ld . Thus, depending on the 
integrability and positivity of dv/£, either no positive measure solves £dji = du, 
or there are infinitely many solutions 

dv 

d^i = — + m d c , m ^ 0, if £1 = £ = £ 2 , 

du 

du = — + + m 2 ^ 2 , m 2 > 0, if £1 ¥= £ 2 , 

parametrized by one or two real parameters, namely, the masses of fi at £ 1 , £ 2 . 

The first of these cases is closely related to the Geronimus transformations on the 
unit circle [25[ [27], defined by 

d/j, = — + m5^ + rri5--i , £(z) = (z — £)(^ _1 — £), |£| <1, me C. 

Starting from a positive measure v on the unit circle, the Geronimus transformations 
yield in general a complex measure fi which generates a Hermitian linear functional 
u[f\ = $ /dyU in A. Since positive definite functionals are represented by measures 
supported on the unit circle, this functional is positive definite iff m = 0, which is the 
case covered by the inverse Darboux transformations. Geronimus transformations can 
provide quasi-dehnite functionals « for m y 0, but the study of these cases in the setting 
of Darboux transformations requires their generalization to quasi-CMV matrices, i.e. 
to the matrices playing the role of CMV in the case of quasi-dehnite functionals on the 
unit circle (see Section I8i2]h 

Summarizing, the inverse Darboux transformation of a CMV matrix T> leads to a 
set 

%{V) ■= {C CMV 

which can be eventually empty, and whose elements can be naturally labelled by at 
most two real parameters. These elements C must satisfy the identities f[T6j) . (TT7T) . f[T8lh 
and their orthonormal zig-zag bases x must be related to an orthonormal zig-zag basis 
co of V by (TH?D . 
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According to the results of the previous section, a matrix procedure to obtain the 
inverse Darboux transforms c &.\(V ) of V starts by performing a reversed Cholesky fac¬ 
torization £{V) = A + A, A e which implies that A must have the 3-band structure 
f|15p . It is possible to prove that, similarly to the standard Cholesky factorization, the re¬ 
versed one is possible iff I? e ^ while, in contrast to the case of finite matrices, reversed 
Cholesky factorizations of infinite matrices are not unique in general (see Appendix [A]) . 
This is in agreement with the fact that V) can have more than one element. For 
each of these reversed factorizations we can obtain a matrix C = AVA e [«S], and 
any solution of the inverse Darboux transformation should be obtained in this way. 
Since we expect in general a parametric solution for this matrix procedure, we will refer 
to it as the Darboux transformation with parameters corresponding to the Hcrmitian 
polynomial £. 

Let us analyze in detail the Darboux transformation with parameters. Suppose T> e 

with a zig-zag basis c o, and consider a reversed Cholesky factorization £(T>) = A + A, 
A s A?. Defining the zig-zag basis y = Aw and the matrix B by £u = B\ leads to 
Bx = £oj = £{V)uj = A + Aw = A + x, so that (jT9|) holds and B = A + due to the linear 
independence of y. Then, we know by Proposition 12.101 that (TT6l) . (1171) and (IT8j) are 
valid for the matrix C e [<S] related to y. Hence, C e foe(V) as long as it is a CMV 
matrix. However, up to now there is no reason to assume that C is CMV. Bearing in 
mind Corollary 12.91 to prove this we only need to show that C is unitary, which in view 
of Proposition 12.71 is equivalent to the identity CC + = I. 

Trying for C a proof similar to that of the unitarity of V for the direct Darboux 
transformations would start from the identities C + A = AT> + and C = AVA -1 in (TT71) 
and (USD- However, the temptation to use the first of these identities to write C + = 
AV + A~ l fails because, using Proposition 12.31 we can only deduce that 

(C + A)A~ l = (AV + )A~ 1 = AVA~\ (33) 

since A and V + are band matrices. The associativity of the first term is not guaranteed 
by Proposition [273] because A is band but A -1 is lower Hessenberg type and C + is upper 
Hessenberg type (we do not know yet if C is a CMV!). 

An attempt to overcome this problem resorts to the direct use of the two identities 
in dm) , together with Proposition 12.31 as follows 

(CC + )A = C(C + A ) = C(AV + ) = (CA)V + = (AV)V + = A( VV + ) = A. (34) 

However, the same associativity problem found in (j33j) reappears now when trying to 
prove that CC + = I using (1341) . The only conclusion from (1M|) is that the rows of CC + — I 
belong to ker^(A) = ker/j(A + ) + , which is known to be non trivial by Proposition 12.111 
Since CC + — I is Hermitian, this is equivalent to state that their columns lie on ker#(A + ), 
or using the notation of Proposition 12.111 

CC + = / + K, I< e Xa+ = {I< = K + : A+K = 0}. (35) 

There is reason for the failure of the previous attempts to prove the unitarity of C: 
not all the reversed Cholesky factorizations £(T>) = A + A, A e lead to a unitary 












C = ADA~ 1 . In other words, the Darboux transformation with parameters not only 
yields the set ApfD) of inverse Darboux transforms of D , but it also gives spurious 
solutions C e [5] which are not CMV. 

Take for instance u = d as the Lebesgue measure and ((z) = (z — C)(- _1 — C); 
|C| = 1- Then V = S and there is no solution /r of (dp, = du, i.e. ^(S) = 0. However, 
since (dd is a positive measure on the unit circle, we know by Corollary 13.31 that 
and, hence, there exist reversed Cholesky factorizations ((S) = A + A, A e A, leading 
to spurious solutions C = ADA -1 . Actually, the functionals of these spurious solutions 
can be even non-Hermitian. This will be illustrated with an explicit example later on. 

The appearance of spurious solutions can be made apparent by rewriting explicitly 
the reversed factorization ((D) = A + A in terms of (l27|) and ((z ) = az + (3 + az _1 , 

^ r l + l 5 n | 2 + \t n \ 2 = P ~ 2Re(a6 n 6 n+ i), 

^ SnT'n+l “1“ tnSn+1 = ^”n+l(^^n ^^ 71 + 2)5 77- ^ 0. (36) 

f»An+2 Q ,< Ai+1 < Ai+2 ; 

In contrast to (l32]h these relations do not determine the coefficients r n , s n , t n . Actually, 
expressing (1361) as 


r n +2 = a 


1 ^"n+l^n+2 


Tn, 


CM, (J n j t \{otb n (M.b n -^ 2 ) Sn^n +1 

^n+1 = | | 1 


tn 


a 


rTn 


\a\ 


Tn. 


\a\ 


(37) 


t„ - V (3 - 2 Re(af)„f)„+i) - r 


shows that r n , s n , t n are determined inductively for n ^ 0 by the three parameters r 0 , 
So, r i, which enclose the freedom in the reversed factorization of ((D). Nevertheless, 
this does not mean that any choice of ro, So, rq should give such a reversed factorization, 
a fact that depends on the positivity of (3 — 2Re(ab n b n+ i) — r 2 — |s n | 2 at every induction 
step. The existence of a solution of (IHlpj) for at least one choice of the “free” parameters 
r 0 , so, ri is equivalent to state that ((D) is positive-definite (see Appendix [All . 

Therefore, the general solution of a Darboux transformation with parameters de¬ 
pends on two real (?’o, rf) and one complex (so) initial conditions, i.e. four real param¬ 
eters. This is in striking contrast with the corresponding inverse Darboux transforms, 
which depend on at most two real parameters. Of course, this counting is indicative 
of the amount of spurious solutions only up to initial conditions r 0 , s 0 , rq giving no 
reversed factorization ((D) = A + A. 

Before presenting an explicit example of a spurious solution, we will give a char¬ 
acterization which allows us to distinguish the CMV solutions from the spurious ones 
with a few calculations. Such a characterization is based on Proposition 12.111 and some 
results concerning the hermiticity of the solutions u of u( = v. 

Lemma 4.1. Let ( be a Hermitian Laurent polynomial of degree one and v a Hermitian 
linear functional in A with u[l] A 0. Then, a linear functional solution u of u( = v is 
Hermitian iff u[l^] = u[k] for a basis Iq, 1\ of Li. 
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Proof. The hermiticity of u on a basis of Li, necessary for its hermiticity on A, is 
equivalent to its hermiticity on L^. On the other hand, the relation u£ = v, together 
with the hermiticity of v and £, leads to 

u[(£f)*\ = u[if*\ = u[/*] = v[f] = u[£f], f e A, 

which means that u is Hermitian in the subspace £A of A. This, combined with the 
hermiticity of u in L 1; implies its hermiticity on A = Lj © £A. □ 

The previous lemma has the following consequences concerning the hermiticity of 
the functionals associated with the solutions of the Darboux transformations with pa¬ 
rameters. 

Proposition 4.2. Let £ be a Hermitian Laurent polynomial of degree one, with 

Schur parameters b n , a n = -y/l — \b n \ 2 and v a related functional. Consider an arbitrary 
solution C e [5] of the corresponding Darboux transformation with parameters, i.e. 
C = AVA 1 with £(T>) = A + A, A e L7. If u is the functional of a zig-zag basis x related 
to C, then u£ = v up to a positive rescaling of £. Besides, u is Hermitian iff any of the 
following equivalent conditions is satisfied: 

(i) u[z~ L ] = u[z], 

(ii) u[xi*] = 0. 

(Hi) Xi* e span{xi,X 2 }- 

(iv) The matrix A has the form (127| with 

Tq = (5 — 2Re(cta), a = b\ + — G\. (38) 

n 

Proof. Assume the hypothesis of the statement and let oj be the zig-zag basis which is 
orthonormal with respect to v. Using Proposition 12.31 we find that CAuj = AT>A~ 1 Auj = 
AVoo = zAoj, which means that Au is a zig-zag basis of C, so a positive rescaling of £ 
yields x = Alu and £u = A + x■ Since A has the structure 027 J, from the latter relation 
we conclude that £oo n e span{x„, Xn+ i, Xn+ 2 } and the functional u of x satisfies u\£uq\ = 
r o u [Xo\ > 0 because Xo is a positive constant and, by definition, u[XnXo*] = 8 n , o- As a 
consequence, «[£]>() and (ucl — u)[u; n ] = cu[Ico n ] — w[o; n ] = 0 for n ^ 1 and any ceC. 
Since ( uc£ — n)[o;o] = 0 iff c = u[l]/«[/:], we get uc£ = v for such a positive value of c. 

The characterizations of the hermiticity of u given in (z) and (ii) follow from the 
previous result, Lemma 14.11 arid the fact that u[x 1 ] = 0 and u\x 0 ] = n[Xo*] > 0 for the 
functional u of y. 

Since u[,\ n ] is non-zero only for n = 0, the relation Xi* e spanll,^^ 1 } c L 2 = 
span{y 0 , Xi) X 2 } shows that zz[yi*] = 0 iff xi* e span{yi,y 2 }- This proves the equiva¬ 
lence (ii) (in). 
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As for the equivalence (in) o (■ iv ), note that xi* E span{xi? X 2 } iff {xi, X 2 , yi*} is 
linearly dependent because {yi,y 2 } is linearly independent. From (1251) we know that 
cr 2 u ; 2 = <^ 1 * + 62 ^ 1 , which, combined with y = An, provides the expansion 


Xi* = s 0 uJo + nuu = s 0 ^o - rib 2 ui + ri<j 2 U) 2 . 


Since Xi = ScM) + r ikh and y 2 = t 0 u} 0 + siuji + r 2 uj 2 , the linear dependence of {xi,X 2 , Xi*} 
reads as 


s 0 

r l 

0 


to 

Si 

T2 

= 0 

So 

—r\b 2 

T\(72 



Introducing the expressions of to, s 1 , r 2 in terms of r 0 , So, r\ given in ()37lh this deter- 
minantal condition becomes (iv) after direct algebraic manipulations. □ 

We are now ready to prove the characterization of the CMV solutions for the Dar- 
boux transformations with parameters. 


Theorem 4.3. Let t be a Hermitian Laurent polynomial of degree one, T> e ^ with 
Schur parameters b n , a n = y/l — \b n \ 2 and v a related functional. Consider a reversed 
Cholesky factorization £(T>) = A + A, A e , the corresponding solution C = AVA^ 1 of 
the Darboux transformation with parameters and a related zig-zag basis y. Then, C is 
CMV iff any of the following equivalent conditions is satisfied: 


(i) The first two rows of C constitute an orthonormal system, i.e. the leading subma¬ 
trix of CC + of order 2 is the identity. 

(ii) There exists a linear functional u which solves ui = v, up to a positive rescaling 
of l, and such that u[y^y^ + ] is the identity, where = (xo,Xi Y- 

(in) The functional u of y satisfies n[yi*] = 0 and u\xiXi*\ = 1- 

(iv) The basis y satisfies xi* E span{yi,y 2 } and 

(Xi/Xo)(z) = P~ 1 (z +a), |a| < 1, P = V 1 - M 2 - 

(v) If £(z) = olz + (3 + az -1 , the matrix A has the form (l27l) with 


r^ = (3 — 2Re(aa), 


, . s o cn 

a = h + —ai, r 1 = r 0 —, 

r 1 p 


|a| < 1, 

p = yi - |a| 2 . 


(39) 


Moreover, the parameter a in (iv) and (v) is the first Schur parameter a\ of C. 

Proof. From Proposition 12.71 and Corollary 12.91 the identity CC + = I is equivalent to 
the unitarity of C and thus to stating that it is CMV. Therefore, the equivalence with 
(i) follows directly from (135|) and the second statement of Proposition 12.111 bearing in 
mind that B = A + and the number of zeros of £ is N z = 2 in the present case. 
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Also, C is CMV iff u[yx + ] = / for a linear functional u in A, and in this case 
Theorem 13.21 states that u£ = v up to a positive rescaling of £, thus u satisfies ( ii ). To 
prove the converse, consider the zig-zag basis u which is orthonormal with respect to v. 
As in the proof of Proposition 14.21 y = Au and £u = A + y up to a positive rescaling of 
£. Suppose that u is an arbitrary linear functional solution of u£ = v. Using Proposition 
12.31 we get A + = u£[uu + ]A + = u[£u(Au) + ] = A + u[xx + ], so that 


u[ X X + ] = I + K, A + I< = 0. (40) 

If u[xd)^( 1 )+] i s the identity, then u[y 0 ] = «[xo*] > 0 and w[y i] = 0 = ii[yi*]. This 
guarantees the hcrmiticity of u due to Lemma 14.11 As a consequence, K = K + , i.e. K e 
J^ 4 +. Then, (140|) and the second statement of Proposition 12. 1 II ensure that 'u[yy + ] = / 
whenever the leading submatrix of w[xx + ] of order 2 , which is w[y( 1 )y( 1 ) + ], is the 
identity. 

Concerning the equivalence with (in), note that a linear functional u satisfying 
u[xx + ] = I is necessarily the functional of y, which therefore must satisfy ufyi*] = 0 
and u[xiXi*] = 1 when C is unitary. Conversely, if the functional u of % satisfies 
these conditions, then rt|x^X^ + ] j s the identity because by definition u[xnX o*] = 5 n ,o- 
Proposition 14. 2 1 ensures that the functional u of y solves u£ = v up to a positive rescaling 
of £, thus we conclude that u satisfies (ii) and hence C is CMV. 

As for the equivalence with (iv), let us write explicitly (xi/Xo)( z ) = n(z + a), k > 0, 
a e C. If C is CMV then, from C\ = %X an d fl2T]l . we get a = a± and k = p£ l . Besides, 
(125|) gives P 2 X 2 = Xi* + U 2 X 15 so Xi* E span{xi, yg}- For the converse, note that the 
functional u of y satisfies u[yi] = u[ y 2 ] = 0, thus u[xi*] = 0 under the condition 
Xu e sp a n{xi,x 2 }. This condition also implies that w[xiXi*] = k 2 (1 — |a| 2 ) because 
X 1 X 1 * = X 0 X 0 * ^ 2 (1 + \a\ 2 + az + az _1 ) = y 0 y 0 *« 2 (l - |«| 2 ) + KaxiXo* + naxoXu a nd 
u [XnX 0 *] = Bearing in mind the equivalence with (Hi), this proves that C is CMV 
under the hypothesis given in (iv). 

Finally, let us prove the equivalence with (n). If uj is the orthonormal basis with 
respect to v we know that y = Au is a zig-zag basis of C and £uj = A + y. If C is CMV, 
using Proposition l2.3l we get u£[ujoj + ] = u[ui(£u>) + ] = A _1 u[yy + ]A = /, so v = u£. From 
Cy = zy and (JUD we find that yq = XoPi 1 (z + ai), thus £xoXo* = «PiXiXo* + ayoXi* + 
(/3-2Re(aa 1 ))xoXo*- Since y 0 = r 0 u 0 , we get = v[y 0 y 0 *] = u[£xoXo *] = /3-2Re(aa 1 ), 
which proves the first identity of (v) for a = a\. The rest of the identities follow from 
f|29p . Let us prove now the converse. The Schur parametrization (OTj) of T> combined 
with Vu = zu yields — biUQ + = zu 0 . Therefore, assuming (v), y = Au yields 


Xo = r 0 uJ 0 , 


Xi = so + r 1 W 1 


— (au o - biu 0 + aiui) = y 0 
P 


z + a 
P 


J 


which gives the second condition in (iv). To prove that C is CMV we only need to show 
additionally that yi* e span{y 1 ,y 2 }. This is equivalent to (l38]h which coincides with 
part of the conditions given in (n). □ 


Proposition 14. 2\ (iv') and Theorem 14.31 (v) provide characterizations of those reversed 
factorizations £(T>) = A + A leading respectively to Hermitian or positive definite func¬ 
tionals, via the Darboux transformation with parameters. Both characterizations are in 
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the same spirit, i.e. they are given as restrictions on the initial parameters ro, So, n de¬ 
termining the matrix factor A. The (hermiticity) positive definiteness restrictions have 
the effect of reducing from four to (three) two the number of “free” real parameters. 
Of course, not any choice of the parameters r 0 , s 0 , r t satisfying such restrictions leads 
necessarily to a reversed factorization £(T>) = A + A, so the only conclusion is that the 
solutions of the Darboux transformation with parameters leading to (Hermitian) posi¬ 
tive def ini te functionals are parametrized by at most (three) two real parameters. This 
is in agreement with our previous discussion based on the interpretation of inverse Dar¬ 
boux transformations in terms of measures on the unit circle. We will refer to (|38l) and 
(l39j) as the hermiticity and the CMV conditions respectively. Note that the hermiticity 
condition ([38]) coincides with the first part of the CMV conditions (l39j) . Therefore, the 
spurious solutions leading to Hermitian functionals are characterized by satisfying (138|) . 
but not the rest of the CMV conditions (|5U1) . i.e. either |a| ^ 1 or r\p ^ r 0 <7i. 

When the CMV conditions are taken into account, the Darboux transformations 
with parameters can be iterated because the set of CMV solutions is a subset of 

the class of CMV matrices C which allow for a reversed Cholesky factorization of 
£(C) (see Appendix®. 

We will finish this section illustrating in a detailed example the coexistence of CMV 
and spurious solutions of the Darboux transformation with parameters. To develop the 
example it will be useful to rewrite the relations (I29p from the perspective of the inverse 
Darboux transformations, that is, 

&n +2 




X 

O'n b n T S n _i 


it, i 

Tn "7 5 ^^"n+1" 


(7 n 


r n +2 




r n +1 

i ■ 

&n+1 


n ^ 0. 


(41) 


Example 4.4. Darboux transformation with parameters for i given by a = —1, (3 = 2, 

£(z) = 2 - z - z- 1 = (1 - z)( 1 - W 1 ), 
and V with constant positive Schur parameters b n = b e (0,1) for n ^ 1, 


( 


V = 


\ 


V 


—b <j 

—ab —b 2 —ab a 2 
a 2 ab —b 2 ab 

— ab —b 2 —ab a 2 

a 2 ab —b 2 ab 

—ab —b 2 —ab a 2 

a 2 ab —b 2 ab ^ 


a 


= VT^¥. 


For simplicity, we will study only the solutions with Sx = 0, a choice which fixes 
two of the four real parameters describing the solutions. Then, the relations (l37|) giving 
inductively the coefficients of the matrix A can be written as 


a 


s 0 =-(1 - b ), 

T'\ 

6’n 0, 


a 


a 


-2(1 + b) - s 0 2 2(1 + b) -y(1 ~ b) - 


a 


„2 > 


<7 


r n+ 2 = 


2(1 + V)^rl 


tn—1 — 


a 


r n +i 


n ^ 1. 


(42) 
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These relations have been expressed using rq, r 2 , instead of ro, r i, as the remaining two 
free parameters. This change is suggested by the role of rf, r\ as initial conditions of 
two identical recurrence relations, namely 


%n +1 


a 


2(1 + b 2 ) — x r 


x n = 


r 2 

' 2n+l> 


or 


n ^ 0. 


2ri+2 > 


(43) 


The recurrence relation with initial condition rj (r|) yields the coefficients r n , hence 
also t n —2 = — cr 2 /r n , for any odd (even) index n. 

Among the possible initial conditions Xo = > 0, those generating solutions of 

the Darboux transformation with parameters are characterized by the fact that (]42]) 
gives x n = 'f 2 n+i) r 2 n +2 > 0 for n ^ 0, as well as by the condition 

£(l-6) 2 + £<2(l + 6) (44) 

'1 '2 

guaranteeing that rj] > 0. 

Due to the positivity requirement of x n , the initial condition must satisfy 0 < Xo < 
2(1 + b 2 ) = £. The recurrence relation (1451) has two fixed points, £+ = (1 + b) 2 < £, 
which provide its constant positive solutions x n = £+. As for the non-constant solutions 
x n , a quick analysis of their positivity follows from the interpretation of (145]) as the 
implementation of Newton’s method to the function h given by 


h{x) 


X — | 46 


The typical behaviour of h is represented in Figure [0 ft is a non-negative concave 
function with a zero at £_, a divergence at £ + and a local minimum at £ = 2(1 + b 2 ). 
This guarantees that x n converges monotonically to £_ whenever x Q < £ + , being strictly 
increasing if x 0 < £_ and strictly decreasing if £_ < x 0 < £ + . On the other hand, every 
xo > £ + leads eventually to x n > £ for some n, thus no such a choice yields a positive 
solution of (145]) . Therefore, the positive solutions x n of (1451) are those associated with 
an initial condition xo e (0,£+], and, up to the fixed point xq = £+, all these solutions 
converge monotonically to £_. 

From these results, and bearing in mind the restriction (144]) . we conclude that the 
solutions of the Darboux transformation with parameters corresponding to Si = 0 are 
given by (142]) for any choice of r\, r -2 with the constraints 


0 < rq, r 2 < 1 + b, 


1 - b 1 + b 2 
r\ + r\ (1 — b ) 2 


(45) 


Let us discuss now the restrictions imposed on these solutions by the hermiticity 
and CMV conditions. According to (158]) . the solutions associated with an Hermitian 
linear functional are characterized by 

7o = 2(l-|-a), a = b --— 2 —-• (46) 

r i 
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Figure 1: The function h(x) whose Newton’s algorithm yields the recurrence for the 
sequence x n given by (03]). It is a non-negative concave function with a zero at £_ = 
(1 — b) 2 , a divergence at £ + = (1 + b) 2 and a local minimum at £ = 2(1 + b 2 ). 


Taking into account the expression of tq in terms of r\, ry given in (021) . the hermiticity 
condition becomes r\ = r 2 . In view of (05]h the solutions of the Darboux transformations 
leading to Hermitian functionals are given by (021) with 

r i = r 2 = t e (1 — b, 1 + b], 


and the related value of r 0 is 

r l = 2(1 + b) (l 


(1 ~ i>) 2 \ 
r 2 J 


The above solutions include the CMV ones which, according to (159]) . are character¬ 
ized by the additional conditions |a| < 1 and rp = r 0 c r, p = Vl — a 2 . Using (|46|) we 
obtain 

|a| < 1 <=> r* > 1 — 6, rp = r 0 a o r = 1 + b. 

Therefore, the only CMV solution with si = 0 is that one obtained from (021) with 

r\ = r 2 = 1 + b , 


which corresponds to constant coefficients r n = 1 + b for n ^ 1, associated with the 
fixed point £+ of (03j) . 

From (1460 . (021) and (011) we hnd the corresponding Schur parameters 

r 3b- 1 
a n = < 1 + b ’ 
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n = 1, 
n ^ 2, 










which provide the referred CMV solution C = ADA 1 of the Darboux transformation 
with parameters, 


C 



p 

—ab —ab 


(T 


2 


pa 


V 


a a —b 2 ab 

—ab —b 2 —ab a 2 
a 2 ab —b 2 ab 

—ab —b 2 —ab a 2 
a 2 ab —b 2 ab 


\ 


...J 


a = 


36-1 

1 + 6 ’ 


p = Vl — a 2 


2 

1 + 6 


6 ). 


The matrix A, coming from a reversed Cholesky factorization £{D) = A + A, and giving 
the Cholesky factorization £(C) = AA + , has the form 


A = 


( ro 

SO r 
t 0 
t 




J 


+ 1 + 6 ’ 
r = 1 + b, 


1 - 6 
s ” “ - a TTt' 

t = -( 1 - 6 ). 


The CMV matrix D is related to an absolutely continuous measure v supported on 
the arc T = {e ld : \ sin || ^ 6} given by 


J sin 2 | - 6 2 

du(e id ) = w(6) dO, w(6) = - T . -, e id s T. 

I Sm 2 I 


Since £(z) = (z — — 1) has a single zero at z = 1, the above CMV solution 

C of the Darboux transformation with parameters must be associated with a measure 
dp = du/£ + mS i for some m ^ 0. 

To analyze the value of the mass m, let us have a look at the Schur function / 
of p. The Schur function / has an analytic continuation through the essential gap 
{e l6 : | sin 11 < 6}, and z = 1 is a mass point of p iff it is a solution of zf(z ) = 1, 
i.e. /(1) = 1. On the other hand, Geronimus’ theorem asserts that the application of 
the Schur algorithm to / generates the sequence (—a, —6, —6, —6,...). Analogously, the 
Schur function g of v is characterized by the constant sequence (—6, —b, —6,...) arising 
from the Schur algorithm. Therefore, g is obtained from / after a single step of the 
Schur algorithm, 


9(z) 


1 f{z) + a 
z 1 + af(z) 


This relation implies that /(1) = 1 iff g(l) = 1, which is not possible because 1 is not 
a mass point of u. We conclude that m = 0 and 


dp{e ie ) 


w(9 ) 
e ie - 1| 2 


d6 


w(9 ) 

2 sin 2 | 


d6, 


e ie e T. 


In other words, the arc T is the common spectrum of C and T?, which are isospectral. 
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The spurious solutions can be also explicitly described. For instance, the choice 
Si = 0, ri = 1 — b, r 2 = 1 + b satisfies (l45|) and yields s n = 0, r 2n _ 1 = D, r 2n = r 2 for 
n ^ 1 , so that 


A 


/ r 0 


r l 

r i 0 V 2 
—r 2 0 ri 

—ri 0 


V2 



r 0 = 2 Vb, s 0 = -a, 

ri = 1 — b, r 2 = 1 + b. 


Since r\ A r 2 , the corresponding solution C = ADA 1 is related to a non-Hermitian 
functional. Explicitly, 


C 


t ^T=6 




—ab 


1 +b 


-fe 1 -& 2 




( 1 - 6)2 

0 

.2 




ib^+Ab-l 


~& 2 -<*£S 


<rb 1 + 4b ~* -b 2 <r&±±fc 

( 1 - 6)2 1-6 


which, solving Cy = zx, yields an associated zig-zag basis y with 


Xo = 1, 


1 — b 

Xl = - - 7r( z ~ !)) 


2 o\fb 


X2 = 


2\Jb(l - b) 


(i bz + 3b — 1 + z ), 


Hence, the conditions u[ y n yo*] = <V,o defining the associated functional u give 


w[l] = 1 = u[z \, u[z x ] = 1 — 46 7 ^ u\_z\, 


which show directly the lack of hermiticity of u. 


5 Jacobi versus CMV 

So far, we have defined the Darboux transformations for CMV matrices by analogy with 
the Darboux transformations for Jacobi matrices. Nevertheless, this does not mean 
necessarily that Darboux for Jacobi and CMV share all their properties. An explicit 
comparison between these two versions of Darboux is necessary to understand to which 
extent the uses of Darboux for Jacobi could be exported to CMV. This comparison 
should highlight the similarities and differences between these two transformations, 
showing also the direct links between them if any. These are the objectives of the present 
section. Thus, we will first review the main features of Darboux for Jacobi, translating 
the standard approach based on LU factorizations into an equivalent one which uses 
Cholesky factorizations for a better comparison with Darboux for CMV. This review 
will be used simultaneously to exhibit the analogies and differences between Darboux 
for Jacobi and CMV. Besides, the classical Szego connection [76] between orthogonal 
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polynomials on the real line and the unit circle will provide a direct link between both 
transformations. This not only supports the present version of Darboux for CMV as 
the natural unitary analogue of Darboux for Jacobi, but also serves as a communicating 
channel between both transformations and their applications. We will also show the 
role of the Darboux transformations in a more recent connection between the real line 
and the unit circle due to Derevyagin, Vinet and Zhedanov [20], 

The standard procedure for the Darboux transformation of a Jacobi matrix 

uses LU instead of Cholesky factorizations. Actually, the usual starting point is not the 
Jacobi matrix itself, but the tridiagonal one 

5= ( + +; i ) ( 47 ) 


obtained by conjugating J with a positive diagonal matrix Lly, i.e. J = LlyJTIj 1 . 
The Darboux transformation starts by choosing /3 e R. such that the LU factorization 
J + j3I = LU is available, and then generates via the identity 1C + (31 = UL a new 
tridiagonal matrix 1C with the shape (l47j) . except that the entries of its lower diagonal 
can be signed. When such a diagonal is positive, /C can be symmetrized by conjugation 
with a positive diagonal matrix LU, leading to a Jacobi matrix 1C = II^/CLU. 

Note that the LU factorization J + (31 = LU exists simultaneously to that of 
J + (31 , whose LU factors are L = II jj- 1 Lily and U = LL^LTIy. Therefore, both LU 
factorizations are possible whenever J + (31 is positive definite. We are going to see 
that in this case the Darboux transformations of Jacobi matrices can be understood in 
terms of Cholesky factorizations, similarly to the developed Darboux transformations 
for CMV matrices. This will make it easier the comparison between Darboux for Jacobi 
and CMV. 

The matrices involved in the above discussion are determined by two real sequences 
v n and u n defined by the lower and main diagonals of the factors L and U, respectively, 



(48) 


More precisely, J, 1C and their symetrizing matrices IIy, Ll^ are given by 


J +(31 = 


( Ml 1 

V\U\ V\+U2 1 

V2U2 V2 + U3 1 


jc +pi 


( Ml+Ml 1 

VlU 2 V2 + U2 1 

V2U3 V3+UZ 




U Z K = —n \U 

Ui 


yu 0 , 


U 0 = 



U 2 

U 3 



?Tn = Y\ U k V k- 
k=l 


( 49 ) 
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By hypothesis, J is symmetrizable, i.e. u n v n > 0 for all n. Then, /C is also symmetriz- 
ablc iff v n u n+ i > 0 for all n, which, bearing in mind the previous hypothesis, means 
that the sequences u n and v n have the same constant sign. This is the case when J + ft I 
is positive definite, which is equivalent to state that u n , v n > 0 for all n and, thus, also 
to the positive definiteness of 1C + f3I. 

It is known that LU and Cholesky factorizations of positive definite matrices are 
closely related. If J + j3I is positive definite, its LU factors provide the Cholesky 
factorization 

J + pi = AA + , A = LU ]' 2 , A+ = U~ l/2 U , (50) 

1 /9 

where U 0 is the positive square root of the matrix Uq given in ()49lh Then, using (1491) 
and (150|) we get 

K. + PI = n^f/ZflK = U- l UjULU^Uic = Uq 1/2 U LUl' 2 = A + A. 

We conclude that the standard Darboux transformations relating Jacobi matrices 
J and K, can be rewritten in the following way: take a real polynomial p(x) = x + /3 
such that p(J) is positive definite. Then, a new Jacobi matrix /C is defined resorting 
to the Cholesky factorization p(J) = AA + , A e and reversing it as p(/C) = A + A. 
The above procedure can be directly generalized to an arbitrary real polynomial p(x) = 
ax + (5 of degree one such that p(J) is positive definite, leading to a triangular matrix 
A with the 2-band structure 



whose lower diagonal has non-zero entries with the same sign as a. This defines the 
Darboux transformation J ^ } C. 

Jacobi matrices encode the three-term recurrence relation of orthogonal polynomials 
with respect to a positive Borel measure supported on an infinite subset of the real line 
(‘measure on the real line’ in short). Analogously to the CMV case, this links Jacobi 
matrices and positive definite linear functionals in the space of real polynomials. More 
generally, tridiagonal matrices with non-null entries in the lower and upper diagonals are 
related to polynomials which are orthogonal with respect to a quasi-dehnite linear func¬ 
tional not necessarily associated with a positive measure on the real line. An advantage 
of defining Darboux transformations via LU factorizations is that they apply directly to 
the general quasi-dehnite case. Although the Cholesky approach to Darboux transfor¬ 
mations previously described has a priori the drawback of being applicable only to the 
positive definite case, Section 18.21 will show how to deal with Darboux transformations 
relating quasi-dehnite functionals by using generalized Cholesky factorizations. 

Analogously to the CMV case, the Darboux transformations of Jacobi matrices 
based on Cholesky factorizations can be embedded into a general procedure for the 
factorization of a real polynomial p evaluated on a Jacobi matrix J. A key role is 
played by the basis p = (po,Pi,P 2 , ■ ■ ■ Y °f the real vector space P = span{x n }* =0 = M[x] 
of real polynomials defined up to a constant factor by 

xp = Jp. 
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These polynomials, which we will assume with positive leading coefficient in what fol¬ 
lows, are orthonormal with respect to a positive Borel measure on the real line, or 
equivalently, with respect to a positive de fini te linear functional on P. Given any other 
basis q = (q 0 , qi, q 2 , ... of orthonormal polynomials related to a Jacobi matrix 1C, the 
matrix A e and the lower Hessenberg type one B defined by 


P = M, PQ = Bp, (51) 

satisfy 

p(J) = AB , p(K) = BA, JA = AK, BJ = KB. 

The result is valid for any arbitrary real polynomial p of any degree, and the proof 
is a direct translation of that of Proposition 12.101 As in the CMV case, the factor A 
plays the role of a change of basis relating the Jacobi matrices by conjugation since 
/C = A~ X J A and J = A1CA~ l . Nevertheless, in contrast to the CMV case, if p has 
degree one, the relations p(J) = AB and p(IC) = BA are enough to generate one of 
the matrices J, K. starting from the other one. 

When p has degree one, p(J) is tridiagonal and B is strictly lower Hessenberg. 
The Darboux transformation amounts to choosing A as the left factor of the Cholesky 
factorization of p(J) when it is positive definite, so that A is 2-band with positive 
entries and B = A + . Furthermore, Theorem 13.21 has an exact analogue for Jacobi 
matrices whose proof follows from similar arguments. 


Theorem 5.1. Let u, v be positive definite functionals in P with orthonormal polyno¬ 
mials p, q and Jacobi matrices J, K respectively. Then, the following statements are 
equivalent: 

(i) J K for a real polynomial p of degree one, i.e. p(J) is positive definite and 
1C = A~ l JA, where A e AT comes from the Cholesky factorization p(J) = AA + . 

(ii) v = up for a real polynomial p of degree one. 

(Hi) p n e span{g n _!, q n } for n ^ 1 and p n f span{g n } for some n. 

(iv) There is a real polynomial p f spanlpx} of degree one such that pq n e span{p„,p n+1 } 
for n ^ 0. 

The polynomials p mentioned in (■ i ) and (ii) coincide up to a constant positive factor, 
and they also coincide with that one in (iv) up to a constant real factor. If p coincides 
exactly in (i) and (ii) then p = Aq and pq = A + p without any rescaling. 

In particular, the transformation J ^ 1C is equivalent to multiplying by (a positive 
rescaling of) p in terms of orthogonality measures, which is known as a Christoffel trans¬ 
formation on the real line. As a consequence, the Darboux transformations of Jacobi 
matrices are almost isospectral since they preserve the spectrum up to the elimination 
of, at most, a single eigenvalue. This is similar to what happens to Darboux for CMV, 
although these transformations may delete two eigenvalues. 
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The only spectral change caused by the corresponding inverse transformations for 
Jacobi matrices is therefore the possible addition of an eigenvalue. The inverse Dar- 
boux transformations of Jacobi matrices are the matrix realization of the Geronimus 
transformations for measures on the real line, which divide a measure by a polynomial 
p of degree one and add a Dirac delta at the zero of p. On the other hand, in the 
language of Cholesky factorizations, the Darboux transformations with parameters for 
a Jacobi matrix fC start with a reversed factorization p(IC) = A + A, A s A7 , and then 
generate a new Jacobi matrix J by either the relation p(J) = AA + or equivalently the 
conjugation J = AK.A _1 . 

At this point, a major difference appears concerning the comparison between Dar¬ 
boux for CMV and Jacobi matrices: if p is of degree one, the freedom in the reversed 
Cholesky factorization p(IC) = A + A is encoded by a single real parameter, the (0,0) 
entry of A. This coincides with the number of free parameters in the Geronimus trans¬ 
formations on the real line, which are parametrized by the mass of the Dirac delta at 
the single zero of p. Actually, contrary to the CMV case, every solution of the Jacobi 
version for the Darboux transformations with parameters corresponds to a solution of 
the corresponding inverse Darboux transformations. In other words, the Darboux trans¬ 
formations with parameters for Jacobi matrices present no spurious solutions, so that 
these transformations become an exact matrix realization of the Geronimus transfor¬ 
mations for measures on the real line. This is in striking contrast with the CMV version 
of Darboux transformations with parameters. 

Due to the absence of spurious solutions, the Jacobi version of Darboux transforma¬ 
tions only involves band matrices and their inverses. Therefore, in view of Proposition 
12.31 in the Jacobi case the associativity of matrix multiplications holds automatically 
for all the Darboux relations which exclude such inverses. This will be used in what 
follows to omit discussing associativity issues when dealing with relations involving only 
band matrices. 

Despite the differences above mentioned, we have found many analogies between 
Jacobi and CMV which justify the name ‘Darboux’ for the transformations of CMV 
matrices introduced in this paper. This similarity can be further highlighted by resorting 
to the Szego connection between orthogonal polynomials on the real line and the unit 
circle ug. This connection will be used in next section for a closer comparison between 
Darboux for Jacobi and CMV, but it can eventually be used to exchange ideas, results, 
techniques and applications between these two versions of Darboux. 

5.1 The Szego connection 

The aim of this section is to show that the Szego connection [76j maps exactly Darboux 
for Jacobi into Darboux for CMV as defined in this paper. As a byproduct, we will 
obtain a direct relation between the Darboux factors for Jacobi and CMV. With this in 
mind, we will first review a matrix approach to the Szego connection due to Killip and 
Nenciu [5Tj (see also uni) which serves our interests. 

The Szego connection is based on the map z >-> x = z + z~*, which transforms the 
unit circle onto the interval [—2, 2] mapping conjugated points onto the same image. 
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This induces the Szego projection /j Sz(/i), a one-to-one correspondence between 
symmetric measures /i on the unit circle (i.e., measures which are invariant under con¬ 
jugation z —> z) and measures Sz(/i) on [—2, 2], which is given by 

J P(x ) d(Sz(yu))(x) = J P(z + W 1 ) dp(z), VP e P. 

The symmetry of p means that it has orthonormal polynomials p n with real coefficients, 
i.e. the corresponding Schur parameters a n are real. Then, the relations 


Je) frl = Z "i&niz) + V2n(z)) 

rn V**'/ /TT7Z -r 

^ 2(1 + a2n) x = z + z~ 1 ( 52) 

(o) / X = Z n (^ n (z) - <P 2 n{z)) 

1 V 2 ! 1 ~ « 2 r ,.)(^ _1 - Z ) ’ 

provide two sequences of polynomials in the variable x = z + z~ l , 

p (k) = ? pp k = e, o. 

We use the convention a 0 = 1 so that p$\x) = Pa(z). The sequences p® and p® are 
orthonormal with respect to the measures on [—2,2] given respectively by 


p e = Sz(/r), dp 0 {x) = (4 - x 2 ) dp e (x), 


Due to the orthonormality with respect to a measure on the real line, there exist Jacobi 
matrices J e , J Ql such that 

xp ^ = JkP ^, k = e,o. (53) 


The relation between the measures p e and p a implies that J e and J Q are connected by 

a Darboux transformation J e > -> J 0 generated by a polynomial of degree two (see 

Section 18.11 for the generalization of Darboux transformations to polynomials of higher 
degree). 

For convenience, we will work with the matrix functions 


Pn 


(Pn 0\ 

VO Pn) ’ 


Pn(z) = P^\x), 

Pn(z) = (Z~ l - Z)p { °\x ), 


x = z + z \ (54) 


which should be considered as functions of z instead of x due to the presence of the 
factor z— z^ 1 in p n . Bearing in mind (154|) . the recurrence relations (j53j) can be rewritten 
in terms of p = (p 0 ,pi,p 2 ,... )* and p = (p 0 ,Pi,P 2 , ■ ■ • Y as 

xp(z) = J e p{z ), xp(z) = JoP(z), x = Z + Z~ X . 

Equivalently, using the notation p = (p Q ,Pi,p 2 , ■■■)*, 

xp(z) = Jp{z), J = Je®Jo , X = z + z~ 1 , (55) 
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with J e ( J 0 ) acting on even (odd) indices. More explicitly, J is the 2 x 2-block Jacobi 
matrix with entries 


2i,2j (Ve)* 


JJ 


2i+l,2j+l 


•3i 


J 


2i,2j+l 


= J 


2i+l,2j 


= 0 , 


and can be understood as the block Jacobi matrix associated with the diagonal matrix 
of measures 


h = 


he 

0 



On the other hand, allows us to rewrite p n and p n in terms of the orthonormal 
Laurent polynomials Xn associated with p, 


X2 n + X2 n* 

y/2(l + 02n) 


X2n X2n* 
y/2(l ^ Cl2n) 


(56) 


If C is the CMV matrix related to p, we denote p = Sz (/ j ), p = Sz(y), J = Sz(C), 
referring to them as the matrix Szego projections of p, y, C , respectively, in contrast to 
the scalar Szego projections p e = Sz (p), p ^ = Sz (p), J e = Sz(C). 

An explicit expression of the matrix Szego projection CT = Sz(C) follows by inserting 
(l25jl into (1561) . which yields 


Pn = 


P2nX2n-l + (1 + «2n)X2n _ 1 

-\/2(l + a 2n) V2 


(Vl “ Ci2nX2n-l + Vl + &2nX2n), 


~ P2nX2n-l ~ (l — 0>2n)X2n 1 , r7 ~;- , n - \ 

Pn -1 — - , =- — “7=( — VI + O 2nX2n-l + VI — a 2n,\'2n.) > 

V 2(1 - «2n) V2 


or, in matrix form, 


Pn -1 


1 /—VI + a 2n VI — ®2n | / X2n- 


hn / V2 \ Vl a 2n Vl + a 2n J \ X2n 

The above relation can be compactly written as 

1 


P 


= S X , s = 


S 2 


S 4 


s n = 


1 f ~ \A + a n Vl" 


\/2 \ -\/l H - d r 


(57) 


in terms of a block diagonal symmetric unitary matrix S. Combining (l57j) with (l55l) 
and (EJ) we get 

JS X = Jp(\) = (z + z~ x )p ({) = {z + z~ x )Sx = S(C + C + ) X , 

which, bearing in mind that S 2 = /, leads to the following explicit expression for the 
matrix Szego projection of a CMV matrix C, 


Sz (C) = J = S(C + C + )S. 


(58) 
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That is, the matrix Szego projection of C stems not only from the evaluation of the 
Szego map z + z~ 1 on C, as one would naively expect, but also from a subsequent 
conjugation with the orthogonal matrix S. We will refer to the matrix S , given in (157]) 
in terms of the Schur parameters a n of C, as the Szego rotation for the CMV matrix C. 

The Szego connection is restricted to symmetric measures on the unit circle, i.e. to 
real CMV matrices because symmetric measures are characterized by having real Schur 
parameters. Therefore, in order to relate Darboux transformations on the unit circle and 
the real line through the Szego connection we should consider a Darboux transformation 
C ^ T> between two real CMV matrices C, D. This means that du = £ dp for certain 
symmetric measures p, v on the unit circle related to C, T>, respectively. If £(z) = 
az + f3 + az~ l , the symmetry requirement for the measures implies that £(z -1 ) = £(z), 
thus a s M. Then, we have the following result which yields the announced Szego 
connection between Darboux for Jacobi and CMV. 

£ 

Theorem 5.2. Let C <—>■ V be a Darboux transformation between real CMV matrices C, 
V with Laurent polynomial £(z) = az + /3 + az~ 1 , Then, the Szego projections 

J e = Sz(C) , IC e = Sz('ZJ), 3 = Sz (C) = 3e © 3 0 , = Sz(X>) = /C e © /C 0 provide Jacobi 

matrices related by the Darboux transformations 3 e 3o with polynomial 

p(x) = ax + (3. The corresponding Cholesky factorizations 

p(3k) k e, o, 

yield the following ones for the matriz Szego projections 

p{J) = AA + , p(K) = A + A, A = A e ® A a , 

where the direct sum A = A e © A a is such that A e (A a ) acts on even (odd) indices, so 
that A has the structure 

(3 \ 

A = © o + 

I © o + 

V .7 

The relation between the Cholesky factorizations for the CMV matrices, 

£{C) = AA + , i(V) = A + A, 
and those of their Szego projections is given by 

A = SAT, A = SAT, 

where S and T are the Szego rotations for C and V respectively. 

Proof. If (i is a measure for C, the relation C *-» T> implies that du = £dfi is associated 
with V. Since p(x) = £(z) under the Szego mapping x = z + z^ 1 , the Szego projections 
p e = Sz (dp), v e = Sz (dv) are related by dv e = pdp e . A similar relation is obviously 
valid for dfx 0 {x) = (4 — x 2 ) dp, e (x) and dv 0 (x) = (4 — x 2 )dv e (x). This proves the 
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Darboux relations Jk K,k for k = e, o. The Cholesky factorizations of J and K arise 
from regrouping those of J}. and K.k for k = o, e. Concerning the relation between the 
Darboux factors A and A, let us consider p = Sz(\'), q = Sz(u;), where y, u are the 
orthonormal Laurent polynomials with respect to /i, v respectively. From Theorems 13.21 
and 15. II we know that x — khn and p = Aq. Combining these equalities with (l57jl and 
the analogous relation q (]) = Ten, we get SAoj = S\ = p{ 1) = Aq(\) = ATuj , 
so that SA = AT. Since S 2 = T 2 = I we can rewrite this identity as A = SAT or 
A = SAT. □ 

The previous result is in agreement with [26], which shows that, in the case of 
symmetric measures, the linear spectral transformations associated with the Christoffel 
and Geronimus transformations on the unit circle are projected by the Szego mapping 
onto similar transformations on the real line. However, we have seen that there exist 
Darboux transformations of CMV matrices which do not correspond to Christoffel or 
Geronimus on the unit circle, namely, those associated with a Laurent polynomial £(z) = 
az + (3 + olz _1 , a,PeR, whose zeros lie on the unit circle, i.e. \f3\ ^ 2\a\. In other 
words, there exist Christoffel and Geronimus transformations on the real line which are 
not the Szego projection of a Christoffel or Geronimus transformation on the unit circle, 
but correspond via the Szego mapping to general Laurent polynomial modifications of 
degree one on the unit circle. 

The close relationship between Darboux for Jacobi and CMV is highlighted by the 
simple relation between the CMV Darboux factor A and its matrix Szego projection 
A = Sz(H) given in Theorem 15.21 This theorem also identifies A as the Darboux factor 
for the Darboux transformation 7C, which is consistent with the relation (]58j) since 
it implies that 

p(J) = S£(C)S = SAA+S = AA + , p{K) = Ti{V)T = TA+AT = A + A. 


We will illustrate the Szego connection between Darboux for Jacobi and CMV with 
an explicit example. 

Example 5.3. The Szego connection and the Darboux transformation for 


£(z) = (z + a)(z 1 + a) = a(z + z ) + 1 + a", a e (—1,1), 


and C with Schur parameters aq = a and a n = 0 for n ^ 2, 

\ 


C = 


/-a p 
' 0 0 0 1 
pa 0 0 

0 0 0 1 
10 0 0 
0 0 0 1 
10 0 0 


p = VT 


a 2 . 


7 


It is known that C is associated with the measure 

, , ,0. 1 dO 1 dd 

Me > " JzTapZr = WTr 
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whose orthonormal polynomials are the Bernstein-Szego ones 


<Pn(z) 


p, n = 0, 

z n ~ 1 (z + a), n 3 1. 


Therefore, the corresponding orthonormal zig-zag basis is given by 


X 2 n-i(z) = z n l {z + a), 


X 2 n(z) 


P , n = 0, 

z~ n ( 1 + az ), n 3 1. 


The Darboux transformation C >-* T> is associated with the Christoffel transforma¬ 
tion // h-> d, where d'd(e l °) = (19/2% is the Lebesgue measure. Hence, V = S is the 
shift matrix given in (jHJ) corresponding to the Schur parameters b n = 0, and the related 
orthonormal zig-zag basis is 

U2n-l{z) = Z n , C02n(z) = Z~ n . 

The matrix A of the change of basis 

( P 

/ a 1 

X = Au, A = I a 0 1 

l a 0 1 



is the Darboux factor for the Cholesky fatorizations £(C) = AA + and £(T>) = A + A. 

Concerning the Szego connection, in this case the matrices S and T coincide because 
a> 2 n = b 2n = 0, thus 


q — t - 

^r. 77. 


-1 1 

V2 V 1 1 

The matrix Szego projection A = SAT of A 

( P \ 

n i ' 


A = 


p 

o i 

y/2 a 0 1 

a 0 1 

a 0 1 


— A e © A a , A e — 


( P 

y/2a 1 




\ 


a 1 
a 1 


A,= 


/ 


1 

a 1 
a 1 
a 1 


provides the factorizations p(J’) = AA + and p(f C) = A + A, where 

p(x) = ax + (1 + a 2 ), 

3 = S(C + C + )S is the block Jacobi matrix 

( -2a 0 V2p \ 


Je = 



0 —a 0 

1 





3 = 

\[2p 0 a 

0 

1 


— 3e © JToj 



1 0 

0 

0 

1 




1 

0 

0 

0 1 




V 



/ 



/ —2a y/2p 

\ 



( -a 1 

\ 

V2 p 

a 1 




10 1 



10 1 


> 

Jo = 

1 0 1 


V 

1 0 1 




1 0 1 

V 

7 


46 












and K = T(S + S + )T is obtained by setting a = 0 in Af. The orthonormal polynomials 
related to 

dd e (x) = —, dd 0 (x) = — V4 — x 2 dx, 

7Tv4 — X 2 7T 

are, np to normalization and rescaling of the variable, the first and second kind Cheby- 
shev polynomials, respectively, 



1 , n = 0 , 

j=(z n + z~ n ), n> 1 , 



1 z n - z~ n 
y/2 Z- Z- 1 ’ 


X = z + z 


while the orthonormal polynomials associated with d/j fc = ddk/p can be obtained from 
the relations p ® = A^q^, i.e. 


p, 

n = 0 

q[ e) + \/2 a, 

n = 1 

dn ] + aqn- 1 , 

n ^ 2 


1 /V 2 , n = 0 , 

(o) (o) 1 

Qn + (ig'n- l , W>1. 


5.2 The DVZ connection 

Christoffel transformations, and thus Darboux transformations, play a central role in 
a new connection between orthogonal polynomials on the unit circle and the real line 
recently discovered by Derevyagin, Vinet and Zhedanov [20] (in short, the DVZ connec¬ 
tion). In this section we will briefly describe the interrelations between the DVZ and 
Szego connections via Darboux transformations. For the details see |20j and also [10], 
where a different approach to the DVZ connection is presented. 

An important ingredient in these interrelations is the symmetrization process for 
orthogonal polynomials on the unit circle [36] [57] , which associates with any measure 
p on the unit circle the only one p which is invariant under z —> — z and satisfies 

^f(z)dp(z) = jf(z 2 )dp(z), V/e A. 

2i_^ ^ 

We will express this relation as p F -> p. The corresponding orthonormal polynomials, 

tp n and <p n , are related by 

$2n{z) = $2n+l{z) = Zlf n {z 2 ), (59) 

2i_> 2 ^ ^ 

so that the induced transformation of CMV matrices C *-——>■ C amounts to the trans- 
formation of Schur parameters (a n ) n ^i F——» (a n ) n ^i = (0, ap, 0, 02 , 0, 03 ,...). 

Since z z 2 reads as x x 2 — 2 if x = z + z -1 , the combination of the Szego 
projection and the above symmetrization process maps the measure p e = Sz (p) onto 
the only one p e = Sz (JT) on [—2,2] which is symmetric under x —» — x and satisfies 

J P(x) dp e (x) = J P(x 2 — 2) dp e (x), VP € P. 
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We will write p f 


x*—>x z —2 


p e . The orthonormal polynomials AA of p e are given by 
An 0 ) = P ( n e) (x 2 -2), P2n+l( X ) = X ~ 2 )> ( 60 ) 


where pffl and qff are orthonormal with respect to p e and its Christoffel transform 
(x + 2)dp e (x), respectively [ID] . The Jacobi matrices J e and /C e of p^A and qA are 
related by the Darboux transformation J e /C e . Since J e = Sz(C), we conclude that 
lC e = Sz (T>) with D the CMV matrix given by the Darboux transformation C AV— 

T>. The relations (l60l) imply that the Jacobi matrix J e of pA satisfies 

J 2 — 21 = J e © /C e , 


where J e and fC e act on even and odd indices respectively. We will express the above 

x~\~2 X 1 —> tc ^—2 ^ 

connections by writing /C e *—- J e > J ei the double line in the right arrow indi¬ 

cating that both, J e and /C e , are involved in J e . 

The DVZ connection starts from a known factorization of CMV matrices into block- 
diagonal unitary factors HU, 


C = MC, C 






Here a n are the Schur parameters of C and, as previously, we use the convention do = 1. 
When these parameters are real, a Jacobi matrix fC is defined by 


K = C + M 


( ao—ai pi 

pi ai— a ,2 P 2 

P2 ffi2—Q3 P3 



(61) 


This establishes the DVZ connection C > ^ V - Z > 1C. Derevyagin, Vinet and Zhedanov 
identify in [20] the measure and orthonormal polynomials of 1C in terms of those of C. 
This identification appears surprisingly when combining a Christoffel transformation on 
the real line with a connection between measures on the unit circle and on the real line 
due to Delsarte and Genin 031- A more direct approach to this problem can be found in 
[TO] , which also uncovers the role of the symmetrization process and the Szego mapping 
in the DVZ connection. If p and ip n are the measure and orthonormal polynomials 
related to C, then 


(x + 2) dp e (x), 


q n (x ) 


3 n (^n(- 2 ) + Ztpnjz 2 )) 
V2(l + z ) 


X = z + z 


are the measure and orthonormal polynomials associated with 1C. Here, p e = Sz(/i) 
is the Szego projection of the symetrization p of p. It turns out that 1C = Sz (T>) is 
the Szego projection of the CMV matrix T> arising from the Darboux transformation 
C ——> T>, where C comes from the symmetrization C ' " - A - > C. 
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These results can be summarized in the following commutative diagram, where Dar- 
boux transformations play a prominent role. It shows explicitly that DVZ follows from 
a mix of symmetrization, Szego and Darboux. 



6 Darboux versus QR 

Prior to Cholesky factorizations, other matrix factorizations have been related to Lau¬ 
rent polynomial modifications of measures on the unit circle. In the same work [8Tj 
that introduces systematically for the first time the CMV matrices, D. Watkins links 
Christoffel transformations on the unit circle to QR factorizations of shifted CMV ma¬ 
trices. This can be seen as the unitary analogue of a similar connection between Jacobi 
matrices and polynomial modifications of measures on the real line first established by 
Kautsky and Golub [50] (see also @01221 EH). A version of this result for Hessenberg 
matrices related to orthogonal polynomials on the unit circle appears in |16j, 251 02j. 
We will describe the QR interpretation of Christoffel transformations on the unit circle 
based on CMV matrices to compare with the Darboux transformations developed in 
the present paper. 

Given a measure g on the unit circle, consider a general Christoffel transformation 

i 

g <->■ v, i.e. 

Z = q*q, q(z) = z -£, |C| < 1- (62) 

Let C, V be the CMV matrices related to g, u, respectively, and y, u the corresponding 

orthonormal zig-zag bases. We know that x = Au with A the 3-band lower triangular 

€ 

factor appearing in the Darboux transformation C V. 

Since dv{z) = \z — Q 2 dg(z), we have that (1 — KDV ^ v ^ (1 + |C|) 2 h- Thus, 
the sets L 2 and L 2 of square integrable functions are identical as topological spaces, 
although they have different Hilbert structures given by the inner products 

(f, g)» = J fg dfi, </, g\ = J fg dv. 

In consequence, the following mappings between Hilbert spaces define bounded linear 
operators with a bounded inverse, 

A: Ll - Ll U: - L\ UA: L\ - L 2 AU : L 2 - L 2 

/ ■—/ f^q*f f^>q*f f *-*q* f 

Indeed, U is unitary because (U/, Ug) M = {f,g}u for every f,g e L 2 U and ran(17) = L 2 . 

To generate matrix representations of these operators, we will choose the zig-zag 
bases x an d w, respectively, as orthonormal bases for the Hilbert spaces L 2 and L 2 U 
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connected by such operators. Since (A\' 0 , Ayq,... Y = x = Aw, the corresponding 
matrix for the operator A is simply the Darboux factor A, so that A reads in coordinates 
as X h-» Y = XA. Here A" and Y stand for row vectors of coordinates with respect to 
X and u respectively, i.e. / = Xx and A f = f = Yu. Analogously, q*(C) = (C — ( I) + 
and q*(T>) = (T> — (I) + are the matrices of the operators UA and AU respectively. 

Contrary to the previous discussions, in this context we do not need to worry about 
the associativity of matrix products because it is guaranteed by the associativity of 
compositions of operators in Hilbert spaces. Likewise, A has a unique bounded inverse, 
that one representing the bounded operator A -1 in the bases u and y. Hence, the linear 
operator X —> XA has a trivial kernel in the space .fj of square-sunmiable sequences. 
This result is not in contradiction with Proposition 12.111 which refers to ker^(i?) = 
ker^(H + ) = keri(A) + , since this formal kernel can have no non-trivial square-summable 
vectors. 

Let U be the matrix of the unitary operator U, i.e. the unitary matrix defined by 
the relation 

q*u = Ux, 

Since UA and AU are obtained by composition of A and U, their matrices are products 
of A and U, 

(C - (I) + X = q*X = q*Au = AUu => (C - (I) + = AU , 

(V - C I) + u = q*u = Ux = UAu =► (V - (I) + = UA. 

Therefore, 

q{C) = C - £1 = QR, q{V) = V-£I = RQ, Q = U + , R = A + , 


with Q unitary and R upper triangular. In other words, q[C) and q{V ) are connected 
by a QR factorization and a reversed one, much in the same way as £(C) = AA + and 
£(T>) = A + A are connected by Cholesky factorizations. Moreover, these two kinds of 
factorizations are closely related because R is the adjoint of the Darboux factor A. 
Actually, the Cholesky factorizations are a direct consequence of the QR ones because, 
due to the unitarity of Q, 

q(C ) = QR => 1{C) = q*(C)q(C) = R + R, 
q(V) = RQ => £(C) = q{V)q*{V) = RR + . 


Like the upper triangular factor R, which is 3-band, the unitary factor Q has a 
remarkable structure. Bearing in mind that R = A + has a bounded inverse, we find 
that Q = (C — (I)R- 1 . This implies that Q has an upper CMV structure, i.e. it is 
upper Hessenberg type with the shape 


Q = 




(63) 
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So far, we have described a QR interpretation of Christoffel transformations on 
the unit circle based on the use of CMV matrices. This result was first obtained by 
D.Watkins in (ST] using a different approach. Indeed, he proves that the transformations 
dg(x) —» \x — /3\ 2 dg(x), j3 e R, on the real line have a similar QR interpretation in terms 
of Jacobi matrices (see also |6, UED])- Therefore, QR suggests that the unit circle version 
of such transformations on the real line should be the Christoffel transformations on 
the unit circle. On the contrary, the Darboux transformations developed in the present 
paper highlight the role of the Christoffel transformations on the unit circle as the 
counterpart of the Christoffel transformations dg(x) —> (x — /3) dg(x) on the real line. 

As in the QR case, the Darboux transformation associated with a Christoffel trans- 
formation fi <—>■ u can be interpreted in terms of factorizations of operators in Hilbert 
spaces. To show this, let us introduce the following bounded linear operators with a 
bounded inverse 


BA:Lj-Lj AB:L*-L* 

/ ~ if / — If 

The equality (Bf, g)^ = (f,hg\ for every / e L 2 and g e L 2 ^ identifies B = A + as 
the adjoint operator of A. Therefore, A + is the matrix of B with respect to the bases 
X of L 2 and u of L 2 , so that we recover the identity £oj = (Boj 0i Bwi,... = A + x- 

The corresponding matrices of BA and AB are £{C) and £(T>), respectively. The fact 
that these two last operators are compositions of A and B translates into the Cholesky 

i 

factorizations related to the Darboux transformation C ^ T>. The relation between 
the QR and Cholesky factorizations is simply the matrix realization of the operator 
identities BA = A + U + UA and AB = AUU + A + , which use the fact that U + = U _1 due to 
the unitarity of U. 

From the operator point of view, the main difference between QR and Darboux is 
that QR looks for the unitarity of one of the factors, U, requiring no symmetry for 
the factorized operators, UA and AU, while Darboux needs the self-adjointness of the 
factorized operators, BA and AB. 

We have seen that the QR interpretation of Christoffel transformations on the unit 
circle yields also the Darboux transformations of the corresponding CMV matrices. 
Nevertheless, QR does not provide all the richness given by Darboux, which allows 
more general transformations of measures g ^ v, not necessarily with the form (1621) 
corresponding to £ with zeros outside of the unit circle. In particular, QR can never 
deal with the case of £ having two different zeros on the unit circle, a situation covered 
by Darboux as long as £ is non-negative on the support of g. As for the case of a double 
zero on the unit circle, i.e. (l62jl with |(j = 1, it can always be tackled by Darboux 
but not by QR. When the double zero does not lie in the support of g the situation is 
as good as for Christoffel transformations concerning QR. Otherwise the operator U is 
certainly isometric but not necessarily unitary because ran (U) = L 2 iff 1 jq e L 2 . 

Let us clarify from the operator point of view the situation which distinguishes 
Darboux from QR, i.e. the case of £ having its zeros on the unit circle. Suppose £ 
non-negative on the support of g, so that £{C) is positive definite and Darboux can 
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be implemented. Then, dv = ld[i is such that any function in L 2 lies also in L 2 , 
but the converse is not necessarily true. On the other hand, the supports of /i and v 
may differ only in the zeros of £, where /i may have mass points but v not. Hence, 
£L 2 makes sense as a subset of Lj r All this ensures that A and B are well defined 
bounded operators which are adjoint of each other, but now the first of these factors 
may have a non-trivial kernel. It is not difficult to see that ker(B) = {0}, while ker(A) is 
spanned by the characteristic functions of the zeros of £, which is non-trivial iff /i has a 
mass point at some of these zeros. From general results (see for instance [82, Theorem 
5.39]) we know that BA = A + A and AB = AA + are unitarily equivalent when restricted 
to the orthogonal complement of ker(A + A) = ker(A) and ker(AA + ) = ker(A + ) = {0}, 
respectively. We conclude that BA and AB are isospectral unless fi has a mass point at 
a zero of £, in which case the spectrum of BA is obtained by adding the null eigenvalue 
to the spectrum of AB. Bearing in mind that, using the notation (12UD . BA = f(U M ) 
and AB = f?(LC), this agrees with the fact that the spectrum of U /t differs from that 
of U„ only in the mass points of /i located at the zeros of £. Note that x( z ) + are the 
coordinates of the characteristic function of { 2 } with respect to the orthonormal basis 
X of L^. Therefore, the previous comments show that the square-summable vectors of 
the formal kernel ker r(B) = ker^(A + ) = ker^(A) + are spanned by the evaluations of x 
at the zeros of i. 


7 Darboux, Ablowitz-Ladik and Schur flows 

It is well known that the Lax pair of the Toda lattice is built out of Jacobi matrices. This 
is the origin of a fruitful three-way relationship among integrable Toda chains, Darboux 
transformations of Jacobi matrices and orthogonal polynomials on the real line. On the 
other hand, the unitary analogue of the Toda lattice is given by the Schur flows, a name 
which refers to an integrable system closely related to the orthogonal polynomials on 
the unit circle. These polynomials also have a close link to another integrable system 
of interest in mathematical physics, the Ablowitz-Ladik model. However, no Darboux 
tool was available yet for the analysis of Schur flows or the Ablowitz-Ladik system. In 
this section we will show that such a tool is provided by Darboux transformations of 
CMV matrices, which play for Schur and Ablowitz-Ladik flows a similar role to that 
of Darboux for Jacobi in Toda chains. More precisely, we will see that Darboux for 
CMV constitutes an integrable discretization of such integrable flows, and also provides 
a mechanism to generate new flows from known ones. I 11 order to highlight the analo¬ 
gies between Darboux for Jacobi and CMV regarding integrable systems, we will first 
summarize known connections between Darboux for Jacobi and the Toda lattice. It is 
worth stressing that this section illustrates eloquently the benefits of the interpretation 
of Darboux as a transformation of measures, a central result in this paper which now 
finds a very useful goal: the simplification of the arguments for statements related to 
integrable systems. 

Using Flaschka’s variables, the evolution of a semi-infinite Toda lattice chain can be 
rewritten equivalently as a flow J(t) of Jacobi matrices governed by the Lax equations 
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dt 




A3) = P+3-P-3, 


where P+ are the upper/lower triangular projectors 


(64) 
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The integrability, which relics on the isospectrality of the flow generated by a Lax 
pair, is also obvious when this flow is translated to the orthogonality measure d//(x, t) 
associated with 3(t), namely [48] 

d//(x, t) = eT xt dfi( x i 0)- (65) 

Actually, the relations (1641) and (l65j) are completely equivalent [66] . 

On the other hand, the iteration of a Darboux transformation with polynomial 
p(x) = x + /3 starting with a given Jacobi matrix 3o, 

s~r P rr P rr P 

J0 l_> Jl l_> J2 l_> ' ‘ ‘ , 

leads to a sequence of Jacobi matrices J n . In view of (149j) . this Darboux iteration reads 
as the difference equations 


V‘k,n +1 T x k—l,n+l Uk,n T x k,n j N 

(bo) 

^k,n+l x k ) n+l 'U , k+l,nVk,ni 

for the LU variables Uk , n , v k,n of + /?/, where we use the natural adaptation of the 
notation in (148|) and (149|) . If we think of J n = 3{nS) as a time discretization of a 
differentiable flow J(t) with time-step S, the difference equations (144|) become the Toda 
lattice equations (16-4]) in the continuous limit 5 —> 0 [Hj 175] . A quick argument for this 
assertion follows from the identity 

e~ xt dfi( x i 0) = lim (1 — 5x) n d/i(x, 0), 5 = —, 

n—>cc n 

which identifies the Toda flow as a limit of iterated Darboux transformations. In other 
words, the Darboux transformations of Jacobi matrices constitute an integrable time 
discretization of the Toda lattice. 

Such an integrable discretization is useful to integrate the dynamical system nu¬ 
merically with higher accuracy. Nevertheless, this is not the only interest of Darboux 
transformations for the Toda lattice. For instance, Darboux transformations can be 
used to generate new solutions of the Toda lattice starting from a known one. More 
precisely, if J(t) solves ([64]) . for any polynomial p of degree one, the Darboux transfor¬ 
mation J(t) 44 JC(t) yields a new solution JC(t) of (164]) . The simplest proof of this relies 
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on the interpretation of Toda and Darboux in terms of the measures dfj,(x, t), du(x, t ) as¬ 
sociated with J(t), JC(t). Since a Toda flow J{t) is equivalent to dp(x, t ) = e~ xt dp(x, 0) 
and Darboux reads as du(x,t ) = p(x) dp(x, t), we get dis(x,t) = e~ xt du(x, 0), so JC(t) 
is a Toda flow too. Therefore, the iterative Darboux transformations of a Toda solution 
J{t) given by any sequence p n of polynomials of degree one leads to a hierarchy of Toda 
solutions 

Inverse Darboux transformations can be also used to generate such hierarchies of 
Toda solutions, although this needs a careful choice of the free parameter appearing in 
inverse Darboux for Jacobi. If A Z(t) follows from an inverse Darboux transformation 

JC(t) i-—> J(t) of a Toda flow J(t), the measures dp(x,t), dv(x,t) associated with 
J(t), K(t) are related by 


du(x,t) = + m(t) S c . 

x C 

Then, )C(t) is another Toda flow provided that the mass has the time dependence m[t) = 
m(0) e - ^, so that dp(x,t ) = e~ xt dp(x,0) implies dv(x,t) = e~ xt du(x, 0). On the other 
hand, the freedom in the mass at ( is equivalent to the freedom in the (0, 0) entry of 
the triangular factor A which determines the reversed factorization J — (I = A + A. 
Denoting this entry by r 0 as in the CMV case, from the relation q = Ap between the 
orthonormal polynomials p, q related to J , /C, we find that Tq = Q'o/Po = \dp,/\du. 
Thus, the time dependence of the mass which preserves the Toda solutions is equivalent 
to a time dependent choice of the free parameter ro(t) given by 



J x ^ dp(x, 0) 

$(.(£-*),«££) +m(0 )' 


(67) 


The arbitrariness of m(0) is equivalent to the arbitrariness of ro(0), which therefore 
determines ro(t) at any time t. This means that, once p(x) = x — £ is chosen, the 
inverse Darboux transformations of J{t) which preserve Toda solutions constitute a 
one-parameter family parametrized by m(0), equivalently by the entry ro(0) of A(0) 
which determines the factorization p(JT(0)) = A(0) + Jl(0). The factorizations = 

A(t) + A(t) giving at other times the Toda preserving inverse Darboux transformations 
are determined by (j67J) . 

Therefore, inverse Darboux transformations are more efficient than direct ones in 
generating new Toda solutions since, for each choice of the polynomial driving the 
transformation, they yield a one-parameter family of Toda solutions instead of a single 
one. Hence, given a Toda flow Jit), the iterative inverse Darboux transformations 
defined by a sequence p n of polynomials of degree one leads to a hierarchy of families of 
Toda flows J n {iTii ,..., m n ; t ) depending on an increasing number of parameters, which 
can be interpreted as a mass m n = m n ( 0) at the zero of p n at initial time, 


• • • hi J 2 (mi, m 2 ] t ) hi Ji(m\] t) hi J(t). 
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The above connections between Darboux for Jacobi and the Toda lattice have uni¬ 
tary analogues that have been ignored so far. They involve Darboux for CMV and 
known unitary counterparts of Toda, the so called Schur and Ablowitz-Ladik flows. 
Schur flows are the unit circle version of the Toda lattice, connected to Toda via the 
Szego mapping, while the Ablowitz-Ladik model follows from the space discretization 
of the nonlinear Schrodinger equation [21 (3j [22j EB [32J [52j EU [63j EU [65] [73]. Both 
integrable systems define a dynamics in the space of “Schur sequences” (sequences in 
the open unit disk) via the difference-differential equations 

Schur flows Ablowitz-Ladik 

d&n 2{ \ ■d&n 2t . \ 

= Pni a n +1 _ a ri— l); ~^~dt = dn\ a "i^+^ + a ri— l); 

| On | < ~- 1 j Pn \/1 | Ch n | , TL ^ 1, <20 1. 

L. Golinskii discovered in 2006 [32] that the Schur flows are equivalent to flows C(t) of 
CMV matrices with Schur parameters a n (t) governed by the Lax equations 

= [*(ReC),C], ReC-i(C + C + ). (68) 

where 7r(-) is defined as in (1641) . In [32], this author also proved the equivalence of the 
Schur flows and the evolution 

dp(z, t) = e ( ' z+z 1 - )t dfi(z, 0) (69) 


for the measure dn(z,t) related to C{t) (see [T7j for an analysis of the flow fl69ll starting 
from the Lebesgue measure). These results constitute the unitary analogue of the 
equivalence among the semi-infinite Toda lattice, (1641) and f|65lh 

The equivalence among Schur flows, (l68j) and (l69jl can be easily extended to slightly 
more general flows including Ablowitz-Ladik, which allows for a unified treatment of 
both integrable systems. The transformation a n —> A n a n , |A| = 1, exchanges Schur flows 
and what we will call Schur flows with parameter A, given by 

= p^(Aa n+ i - Aa n _i). (70) 

The Ablowitz-Ladik flows become the Schur flows with parameter A = i. Since the 
transformation a n —> A n a n is equivalent to the rotation d/j,(z,t ) —> dp(Xz,t) of the 
measure, from (1691) we find that (1701) reads as 

dp(z, t) = e ^ z+Jz ~^ d/i(z, 0). (71) 


Besides, from the explicit Schur parametrization (12T1) of CMV matrices we can see that 
~^ A n a n is equivalent to C —* AACA + with A the diagonal unitary matrix 


A 


( 


V 


A 2 
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(72) 


Hence, using (168]) we conclude that (1701) can be rewritten as the Lax equation 

^ = WRe(AC)),C], 

Actually, the equivalence among (17(1 , (ITT]) and (1721) is a particular case of a much more 
general one concerning what is known as generalized Schur flows [73]. Nevertheless, 
Schur flows with parameter A (in short, Schur flows) are enough to understand the role 
of Darboux for CMV in integrable systems. 

The exponential character of the evolution for the measure under Schur flows allows 
us to search for an integrable discretization of such dynamical systems using arguments 
which parallel those of the Toda case. Rewriting 

e (\z+\z-i)t d , z , Q) = lim n + + 5\ z -i\n dfi(z, 0), 8 = -, 

n—* oo 77, 

shows that Darboux for CMV can be understood as an integrable discretization of the 
Schur flows [63]. This means that (170]) should be obtained as a continuous limit of the 
relation (l3Tj) between the Schur parameters of CMV matrices connected by Darboux. To 
check this consider the iteration of a Darboux transformation with Laurent polynomial 
6(z) = 1 + az + az~ 1 starting with a CMV matrix C 0 , 

Cq ^ Ci ► C 2 • • • , 


and suppose that C n = C(n8) come from a time discretization of a differentiable CMV 
flow C(t) with time-step 8 = |a|. Then, (T3TT) yields the following relations among the 
Schur parameters a^n of C n , 


K 


Q j k,n +1 ^k,r, 


k— l,n+l 




(XClk— l,n+l) 


k,n 


plnPln ' 


pL 


(73) 


1 - 2Re(aai, n ) p\ n +ipl, n +i ' ■ ■ Pk-i,n+i 




OLC^k— l,7i+l) ? 


where pk, n = -yA — \cik,n\ 2 i K kn = Pk,n' • • Pk,i\J]dfi n and the measures p n associated 

with each C n are related by dp , n+ 1 = £dp n so that $ dp n+ 1 = (1 — 2Re(ctai in )) $ dp n . The 
difference equations (173]) define explicitly an integrable discretization of the Schur flows 
[63]. Indeed, expressing a = 8 A, |A| = 1, we get 


^/c,n+l ^ k,r 


p\,nPl,n ■ ■ ' P%, 


1 - 2ARe(Aai >n ) p\ 

) n+lP2,n+l 


Pk—l,n+l 


(Attfc+qji Attfc—l,ri-|-l) 5 


which in the limit <5 —» 0 obviously gives a Schur flow with parameter A. 

ft is also possible to generate hierarchies of Schur flows starting from a given one 
C(t) by iterating Darboux transformations of CMV matrices, 


C(t) A Ci(t) A C 2 (t) & ■ ■ ■ . 
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Analogously to the Toda case, this is a simple consequence of the characterization 
of Schur flows and Darboux for CMV in terms of measures. Any Darboux trans¬ 
formation C(t) T>(t) translates into the relation du(z,t) = £(z) d/j,(z,t) for the 
measures dfi(z,t), du(z,t ) of C(t), T>(t), thus d/j,(z,t) = e^ Xz+Xz ^ dfi(z, 0 ) implies 
du(z,t) = e^ Xz+Xz ls>t dv(z, 0 ). 

Inverse Darboux transformations can improve the generation of new Schur flows 
because each iteration of inverse Darboux provides in general a parametric family of 
such flows under a suitable choice of the freedom in the reversed Cholesky factorizations. 
Consider for instance the case £(z) = (z — ( )(z~ 1 — () with |£| = 1. In terms of the 
measures d/j,(z, t), du{z ) t ) of C(t), T>(t), an inverse Darboux transformation V(t) C(t ) 
reads as 

du(z, t ) = + m(t) S ( . 

\z - Cr 

Bearing in mind the characterization (T7Tj) of Schur flows, this transformation maps 
a Schur flow with parameter A into another one exactly when 777 (f) = m{ 0)e 2Re< ' A ^h 
On the other hand, Theorem 14.31 shows that the freedom in the reversed factoriza¬ 
tions £(C) = A + A leading to CMV solutions of inverse Darboux is encoded in the first 
Schur parameter b\ of the inverse transform T>. Since b\ = —\z dv/ ^ dis, we find that 
the inverse Darboux transformations which preserve Schur flows with parameter A are 
characterized by a time dependent choice 


bi(t) = 


^ e 2Re(A(,— C ))* W) +m ( 0 ) C 
^ e 2Re(A(,-C)yfM^ +m ( 0 ) ’ 


(74) 


determined by its initial value &i(0) via the arbitrary parameter m{ 0). Hence, this mass 
parametrizes the inverse Darboux transformations preserving Schur flows. 

Thus, given a sequence l n of Hermitian Laurent polynomials of degree one with a 
double zero, the iterative inverse Darboux transformations of a Schur flow C(t) yield a 
hierarchy of families of Schur flows C n (mi,. . . ,m n \t) parametrized by the initial mass 
m n = m n (0) at the zero of each £ n , 


• • • t-A £ 2 ( 777 . 1 , m 2 ; t) A- Ci(mi; t ) A- C(t). 


In case of £ n with two distinct zeros on the unit circle, the number of parameters 
increases by two at the 77 -th iteration of inverse Darboux, namely, the initial masses at 
both zeros of £ n . 

The above ideas suggest that exploiting the links between Darboux for CMV and 
Schur flows should be as fruitful as for Darboux for Jacobi and Toda lattices, although 
this requires further research beyond the scope of the present work. 


8 Further extensions 

The Darboux transformations of CMV matrices introduced in this paper admit a couple 
of generalizations of interest. One of them has to do with the degree of the Hermitian 
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Laurent polynomial involved in the transformation. The results of the previous sections 
carry over directly to polynomial modifications of higher degree. Indeed, announcing 
this generalization, some of the results previously obtained were proved for Laurent 
polynomials of arbitrary degree (see Section EJZjh 

On the other hand, as we pointed out previously, Darboux transformations of Ja¬ 
cobi matrices can deal also with tridiagonal matrices whose lower and upper diagonals 
are not necessarily positive, but simply non-null. This connects such transformations 
with general quasi-definite functionals on the real line or, in other words, with orthog¬ 
onal polynomials with respect to signed measures supported on the real line. Although 
this generalization seems to be missing in the Cholesky approach to Darboux trans¬ 
formations, the quasi-dehnite case can be tackled resorting to generalized Cholesky 
factorizations AEA + , A e involving diagonal matrices E whose diagonal entries are 
+ 1 (‘sign matrices’ in short). The matrices playing the role of CMV for this general¬ 
ization constitute a wider class of matrices C which can be non-unitary, but are always 
quasi-unitary [83j with respect to a certain sign matrix E, i.e. 

CEC + = E = C + EC. 

We will call quasi-CMV matrices to such a generalization of CMV, given by (J79]). Al¬ 
though not named in this way, quasi-CMV matrices were introduced for the first time by 
some of the authors in the paper HU which rediscovered CMV a decade after Watkins’ 
one (8lj. Actually, nu introduces directly quasi-CMV matrices, from which CMV arises 
as a subclass. 

In this section we will briefly discuss these two generalizations of Darboux transfor¬ 
mations for CMV matrices. 

8.1 Higher degree transformations 

Darboux transformations of CMV matrices can be associated with any Hermitian Lau¬ 
rent polynomial of arbitrary degree. They are defined exactly as in the case of degree 
one. Since Section 12.21 central for the rest of the paper, is developed for an arbitrary 
degree, the generalization to higher degree of the results previously obtained for degree 
one is straightforward. Therefore, the aim of this section is not a detailed discussion of 
higher degree Darboux transformations, but to give an account of those features which 
depend on the degree. 

Let £ be a Hermitian Laurent polynomial of degree d, i.e. with N z = 2d zeros 
counting multiplicity. Given a CMV matrix C, the Cholesky factorization £{C) = AA + , 
A e 3T, exists iff £[C) is positive definite, and then it is unique (see Appendix O. Due 
to the CMV shape Q of C, the Hermitian matrix £(C) is 2 N z + 1-diagonal with non-null 
entries in the upper and lower diagonals, so A is N z + 1-band with non-null entries in 



the lower diagonal, i.e. 
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This defines the higher degree Darboux transformations C i-> T> = A~ L CA of CMV 
matrices. As in the case of degree one, they constitute the matrix translation of the 
mapping u >—» v = u£ between positive definite functionals. Actually, the positive 
definiteness of £(C), which guarantees its Cholesky factorization, is also equivalent to 
the positive definiteness of u£, as well as to the unitarity of X>, which is therefore CMV. 
Besides, the same factor A appearing in the Cholesky factorization of £(C ) gives the 
reversed Cholesky factorization £(T>) = A + A. 

Since every non-constant Hermitian Laurent polynomial factorizes into degree one 
ones, the functional interpretation of higher degree Darboux transformations implies 
their equivalence with compositions of degree one transformations. However, this is not 
the case for higher degree Darboux transformations with parameters, which exhibit a 
higher richness of spurious solutions than mere compositions of degree one transforma¬ 
tions. To show this, consider a Hermitian Laurent polynomial t with N~ zeros, a CMV 
matrix T> and aiV, + 1-band triangular matrix A e 3T. Let us rewrite £{T>) and A as 
N z x A^-block matrices, 


m = 


A = 


( L 0 M+ 

M 0 L\ M+ 

Mi L 2 Ml 

V 


.) 



/© * *...* * \ 


R n = 


Ln = K, 


( + .+ 

* * + 

* * * + 


M n = 


\ * 


* * • • • * * + 


/< 


* * * 
© * * 
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* H« • • • efe * 


s n = 


He H« He 

© * * 

© * 


Note that Rq is the N z x N z leading submatrix of A highlighted in (1751) . and M n , R n , S n 
are non-singular for all n ^ 0. Then, the reversed Cholesky factorization £{T>) = A + A 
reads as 

R+Rn + s+s n = L n , Rn+iS n = M n , n > 0. (76) 

Since S n is non-singular, it has a unique polar decomposition S n = with V n 

unitary and |iS n | the positive definite square root of S+S n . This allows us to rewrite 
(1761) as 

|5 n | 2 = L n - R^R n , \S n \M- 1 = V n + (R:)-\ 

Therefore, R n determines | < S' r! .| via the first equation, while |S' n | fixes V n and R n + 1 , 
provided by the QR factors of |S' ri ,|M" 1 as the second equation shows. This implies that 
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all the blocks of A are inductively generated starting from a choice of Ro, which is only 
restricted by the condition that must be positive de fini te for all n ^ 0. Therefore, 
apart from this constraint, the entries of R 0 can be considered as ‘free’ parameters 
describing all the reversed Cholcsky factorizations of f?(D). In the case N z = 2, using 
the notation fl2T|) we get R 0 = (s°n), which encodes the ‘free’ parameters already 
known for degree one. 

The number of non-trivial entries of is N Z (N Z +1)/2, among them N~ positive (the 
diagonal ones) and the rest are, in general, complex numbers. Therefore, the number 
of ‘free’ real parameters of a Darboux transformation with parameters is N%, where N z 
is the number of zeros of the corresponding Hermitian Laurent polynomial. This in 
contrast with the composition of degree one Darboux transformations with parameters 
associated with d = N z /2 factors of the Hermitian Laurent polynomial, which only leads 
to 4 d = 2 N z ‘free’ real parameters. 

Although the composition of degree one Darboux transformations with parameters 
gives in principle less spurious solutions than higher degree ones, this iterative procedure 
to deal with higher degree inverse Darboux has a drawback because such a composition 
is not feasible in most of the cases. The reason is that every single degree one step 
gives many spurious solutions which are not CMV neither unitary, preventing us from 
composing with a new CMV Darboux transformation. This forces us to take into 
account the CMV conditions (15U|) at any step. Then, the resulting contrained iterative 
procedure should end up with a set of solutions parametrized by d complex numbers 
in the open unit disk, playing the role of the first d Schur parameters of the CMV 
solutions. 

Another reason could increase even more the interest of higher degree Darboux 
transformations with parameters for CMV: The spurious solutions might have their own 
interest if associated with non-standard orthogonal polynomials on the unit circle. There 
is a strong reason to expect an interesting connection with non-standard orthogonality. 
Higher degree Darboux transformations with parameters for Jacobi matrices have been 
already considered in the literature [21 i [28J. Contrary to the case of degree one, they 
present spurious solutions which are related to Sobolev orthogonal polynomials on the 
real line [T8, E]. 

The parametric analysis of spurious solutions for Jacobi follows the same arguments 
given previously, since they work for any matrix which, like £(C), can be considered as 
a block-Jacobi one. Then, if a real polynomial p has N z zeros (coinciding now with the 
degree), the corresponding Darboux transformations with parameters of a Jacobi matrix 
J are described by N Z (N Z + 1)/2 ‘free’ parameters, those encoded in the N z x N z leading 
submatrix of p(j7’). The only quirks of the Jacobi case is that all these parameters are 
now real and that, in contrast to the CMV case, any (not necessarily even) number N z 
of zeros is possible. Actually, the fact that the minimal step for Jacobi and CMV is 
N z = 1 and N z = 2, respectively, is the cause of the possibility of avoiding spurious 
solutions in the Jacobi but not in the CMV case. 

Due to the equivalence with polynomial transformations of measures, higher degree 
Darboux transformations of Jacobi and CMV matrices are both almost isospectral. 
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More precisely, the Darboux and inverse Darboux transformations may delete or add, 
respectively, at most N z eigenvalues. 


8.2 Quasi-CMV matrices 

Quasi-CMV matrices are the matrix representations of the multiplication operator ([4j) 
with respect to zig-zag bases which are ‘orthonormal’ with respect to quasi-definite 
functionals on the unit circle. Such functionals are those Hermitian linear function¬ 
als u on A whose Gram matrix in P n = span{l , z, z 2 ,..., z n j is non-singular for all 
n = 0,1,2,.... (equivalently, the Gram matrix of u in IP with respect to a basis 
p n e P n \P n _i is quasi-definite, i.e. all its leading submatrices are non-singular). This 
property, independent of the basis chosen for the Gram matrix, is equivalent to the ex¬ 
istence of a sequence of polynomials ip n e P n \P n _! which are ‘orthonormal’ with respect 
to w, i.e. u[(fi n (p m *] = e n S n!m with e n e (1, —1}. The polynomials ip n satisfy forward and 
backward recurrence relations which generalize (l22j) . 


Pn<fn(z) = ZCPn-l(z) + (z ), 

Pn zl Pn-l(z) = lf n {z) - a n (fi*(z ), 


Pn | | [j 

Pn _ Pn Sg(l |®n|^)Pri) 

^n—1 


(77) 


where sg(x) = x/\x\, x e R\{0}, is the sign function and the Schur parameters a n are no 
longer restricted to the open unit disk, but they can lie anywhere in the complex plane 
outside of the unit circle. 

The previous extension of orthogonal polynomials on the unit circle goes back at 
least to the works of Geronimus ED- ffowever, the connection of the quasi-definite case 
with ‘orthonormal’ Laurent polynomials and quasi-CMV matrices, which we su m arize 
below, is of much more recent vintage [IT] . 

Since f(z) >-* z~^ n ^f{z) is a one-to-one correspondence between P„ and L n which 
preserves the sesquilinear form (/, g) u[fg*\, the quasi-definiteness of u also means 
that its Gram matrix in L n is non-singular for all n = 0,1, 2,... (equivalently, the Gram 
matrix of u in A with respect to a basis l n e L n \L n _i is quasi-definite). Therefore, 
analogously to the previous discussion, the quasi-definiteness of u is equivalent to the 
existence of a zig-zag basis y which is ‘orthonormal’ with respect to u, i.e. u\xnXm*\ = 
e ' n fin,m, e' n e {1,-1}. Actually, as in the positive definite case, the relation between 
the orthonormal polynomials Lp n and the orthonormal Laurent polynomials Xn is given 
by (PH) . This implies that e' n = e n , so that the orthonormality condition for y can be 
rewritten as ?/[yy + ] = E, where E denotes the sign matrix 


/ e° \ 

ei 

e 2 

V ■■■/ 


e 0 = sg(u[l]), 

n 

e„ = e 0 nsg(l-K| 2 ), n>l. 

k= 1 


(78) 
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It follows from (124)) and (1771) 

that zx 

= C\, where 
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This matrix, which is the representation of the multiplication operator with respect to 
X, has a unique inverse, the matrix representation of z with respect to We will refer 
to C as a quasi-CMV matrix due to its quasi-unitarity with respect to the sign matrix 
E given in (1751) . Since +u have the same orthonormal zig-zag basis, the quasi-definite 
functional related to a quasi-CMV matrix is unique up to a non-null real rescaling. 

A drawback of Darboux transformations for a quasi-CMV matrix C is the possible 
lack of hcrmiticity of £(C) for a Hermitian Laurent polynomial £(z) = az + f3 + Txz -1 . 
This is overcome by the quasi-unit arity of C, which implies that C _1 = EC + E. As a 
consequence, we get two Hermitian matrices, 


£(C)E = aCE + /3E + aEC^ 


E£(C) = a EC + (3E + aC + E , 


which will share in the quasi-CMV case the role played by £(C) for CMV matrices. The 
hermiticity of £(C)E or £(C)E characterizes the sign matrix E which makes C quasi¬ 
unitary, a result proved in the following lemma, which also shows that such a sign matrix 
is determined by C up to a sign. 

Lemma 8.1. Let C be a quasi-CMV matrix, E a sign matrix and £ a Hermitian Laurent 
polynomial of degree one. Then, the following statements are equivalent: 

(i) C is quasi-unitary with respect to E. 

(ii) £(C)E is Hermitian. 

(Hi) E£(C ) is Hermitian. 

(iv) E = ±E, where E is given by (178)) . 

Proof. We already know that ( i ) =^> (ii), (i) => (Hi) and (iv) =^> (i). On the other 
hand, (ii) <s> (Hi) obviously because E£(C) = E(£(C)E)E. Hence, it only remains to 
see that (ii) =^> (iv). Let us write £(C)E = ME, where M = £(C)E is a five-diagonal 
matrix with the shape (128|) and E = EE is a sign matrix. Assuming (ii) means that 
ME is Hermitian, which is equivalent to state that ME = EM because M and E are 
also Hermitian. It is easy to see that, due to the structure ()28l) of non-null matrix 
coefficients of M, the only sign matrices which commute with it are +1. We conclude 
that E = +/, which proves (iv). □ 
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If u is a quasi-definite functional associated with the quasi-CMV matrix C and E is 
the corresponding sign matrix, then we get 

u£[xx + ] = u[?xx + ] = £(C)u[xx + ] = £{C)E. 

As a consequence, we obtain the quasi-definite version of Corollary 13.31 u£ is quasi- 
definite iff £(C)E is quasi-definite. The last condition is also equivalent to the existence 
of a generalized Cholesky factorization (see Appendix [All 

£{C)E = AFA + , A e tA, Fey = set of infinite sign matrices, (80) 

which is also unique. Since £(C) has the five-diagonal structure (fl4jl . as in the CMV 
case, A has the 3-band shape (fT5j) . The factorization (l80l) is the starting point for the 
Darboux transformations of quasi-CMV matrices. 

To introduce such transformations we just follow the same steps as in the case of 
CMV matrices. From Proposition 12.101 the zig-zag basis u given by x = -Au anc l th e 
matrix B defined by £uj = Bx yield the factorization £(C) = AB. Comparing this with 
f[80jh taking care of the associativity with Proposition 12.31 and bearing in mind that 
ker^(A) = {0}, we conclude that B is the 3-band upper triangular matrix 

B = FA + E. (81) 

Using again Proposition 12.101 we also find that T> = A~ 1 CA is the matrix of the multi¬ 
plication operator with respect to u> and satisfies 

F£(V) = A + EA, dey, FeY, (82) 

which is a generalized reversed Cholesky factorization. Besides, the associativity prop¬ 
erties of Proposition 12.31 and (18TT) yield 

u£[ukj + ] = u[£uu> + ] = -Bu[xx + ](A + ) -1 = F. 

This identity has several consequences: The zig-zag basis u is orthonormal with respect 
to the quasi-definite functional v = u£, thus P is a quasi-CMV matrix and F is the 
corresponding sign matrix. 

Sumarizing, given a quasi-CMV matrix C with sign matrix E and a Hermitian 
Laurent polynomial £ such that £(C)E is quasi-definite, we have found a transformation 
C 1 —> — A~ V CA which generates a new quasi-CMV matrix T>. The ingredients of this 
transformation, i.e. the lower triangular matrix A and the sign matrix F of T>, are given 
by the generalized Cholesky factorization (1801) of £{C)E. The matrix F£(T>) satisfies 
a generalized reversed Cholesky factorization obtained from that of £{C)E by changing 
A —> A + and F > E. This is what we call the Darboux transformation for quasi-CMV 
matrices. In perfect agreement with the CMV case, this transformation is the matrix 

i 

translation of the transformation u i—> v = u£ connecting quasi-definite functionals. 

Concerning inverse Darboux transformations, starting from a quasi-CMV matrix T> 
and its sign matrix F, the matrix transformation C = ADA -1 , with A and E given by 
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(182]) . defines the Darboux transformations with parameters for quasi-CMV matrices. 
This matrix procedure solves the inverse problem, which consists in finding all the 
quasi-CMV matrices C such that C >—► V, but also yields spurious solutions C which are 
neither quasi-CMV nor quasi-unit ary. Analogously to the CMV case, the solutions of the 
inverse Darboux transformations coincide with the quasi-CMV solutions of the Darboux 
transformations with parameters. To prove this note that, if c o is an orthonormal zig-zag 
basis for V and x = Au then, from (l82]i and Proposition (273J lu = £(V)uj = FA + EAlo = 
Bx with B given by (IHTT) . In view of Proposition 12.101 we find that V = A~ 1 CA with 
£(C) = AB = AFA + E. Since £(C)E = AFA + is Hermitian, Lemma 18.11 proves that, 

whenever C is quasi-CMV, E is its sign matrix up to a sign. Therefore, C i-> T> for such 
a quasi-CMV matrix C. 

The theory of Darboux transformations for quasi-CMV matrices can be developed 
just by mimicking the steps and arguments given for the CMV case. All the results 
obtained in the previous sections have a natural generalization to the quasi-CMV case, 
the details are left to the reader. We will simply mention some specific results of interest 
for the example discussed below. 

Assuming the notation (1271) for the matrix coefficients of the factor A and denoting 
by f n the diagonal entries of the sign matrix F , the factorization (15UD reads as 

fn— 2 ^ 71 — 2 I T fn —11 ^n —11 T fnJ' n ( P 2Re(ctCl n (Z n _|_x)) , 

< fn—l^n—An—1 fnl n$n ^n+lPn+1 OM n + 2 )) 71 ^ 0. (83) 

fnTnt'n ®®n+2Pn+lPri+2 j 

These equations, together with the initial conditions t _2 = t- 1 = s_i = 0, make 
explicit the Darboux transformations for quasi-CMV matrices. Besides, generalizing 
the identities in (l29j) . (130|) and (ITTll we get the following direct relations between the 
matrix coefficients of A and the Schur parameters of the quasi-CMV matrices involved 
in a Darboux transformation, 


tin 


^n—1 Pn t"n —1 

Tn tn Pn+2 


7 _ —7 r T*n+ 1 

On $n— 1 3 C^^n+2 &O n &n+lJn$n • 

Tn ^n+1 


(84) 


As for the Darboux transformations with parameters, inserting (1271) into the factor¬ 
ization (1821) shows that they are explicitly given by 

T Di+l|^n| V 6 T j._|_2 I| fn (/^ 2Re(cffi n f) n _|_i)) , 

< € n +±S n V n - 1-1 T &n+2tnSn+l fn&n+ l(o , ^n (^n+f) i 71 ^ 0, (85) 

^n+2^nDi+2 ^fn^n+ ]Vn+2j 

where b n are the Schur parameters of V and a n = ^\1 — |6„| 2 |. These relations de¬ 
termine e n , r n , s n , t n once eo, r 0 , so, ei, D are chosen, as follows from rewriting (|85ll 
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as 


/ \ || ^n+l^n+2 > ^ £ 

^n+2 = SS(^n)? T n+2 ^ j 1 \ &n+2 Jn^n i 

r n \a\ 

OL d n -\-\{oib n Q^n+2) ^n+l./n^n^’n+l 

^n +1 = "j 7 j 

| | T n 

7"n V |£71 1 5 fn (/^ 2R,6(o(& ri & 77 > _|_i)^ C n T n 6 n _|_x|^n| 


(86) 


Therefore, the solutions of the Darboux transformations with parameters can be parametrized 
by e 0 rg, s 0 , e\ r 2 , which give again four real ‘free’ parameters. The existence of such 
a solution for a given value of these free parameters is equivalent to state that e n A 0 
for all n ^ 0. Among these solutions, the quasi-CMV ones are characterized by a 
generalization of the CMV conditions (l39]h namely, 


eo/cTo = P~ 2Re(aa), 

, . so 

a = b\ -\ - a i, 

r-i 


<7\ 

n = r 0 —, 
p 

ei = e 0 sg(l - |a| 2 ), 


|a| A 1 , 

__ (87) 

p = VI 1 - l a l 2 l- 


The conditions in the second column of (187)1 can be combined into ei/ir 2 (l — |a| 2 ) = 
e o/o r o(l — |&i| 2 ). Like in the CMV case, a is the first Schur parameter aq of the cor¬ 
responding quasi-CMV solution, while the pair of equalities in the first column of the 
quasi-CMV conditions flSTD select the solutions associated with (not necessarily quasi- 
definite) Hermitian functionals. 


Example 8.2. Darboux transformation with parameters for a general Hermitian Lau¬ 
rent polynomial of degree one £(z) = az + ft + az^ 1 and V = S the shift matrix given 

in OSD- 


We will illustrate with this example the coexistence of CMV, quasi-CMV and spuri¬ 
ous solutions in the Darboux transformations with parameters. The shift matrix T> = S 
is a CMV with Schur parameters b n = 0 (so, <r n = 1) for n ^ 1, thus F = I is a sign 
matrix for S. We will search for solutions C of the corresponding Darboux transfor¬ 
mation with parameters, considered in the generalized sense described in this section. 
This allows the appearance of quasi-CMV solutions C with a non-trivial sign matrix 
E. Without loss of generality we can take |ck| = 1, which simply means that we avoid 
a trivial positive rescaling of the factor A in the reversed factorization £(S) = A + EA, 
where 


*(S) 


/ p cx. a 

a (3 0 a 

a 0 /3 0 a 

a 0 ft 0 a 

a 0 /3 0 a 


( 88 ) 


The diagonal entries e n of the sign matrix E, as well as the coefficients r n , s n , 
t n giving and the three diagonals of A, are determined by (1861) . In what follows we 
will restrict our attention to the subset of solutions with si = 0, a condition which 
becomes a constraint between the parameters e 0 rg, s 0 , eir 2 describing the whole set of 
solutions. Therefore, the referred subset of solutions can be parametrized by e^r^ and 
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eirf, although instead we will use e\r\ and for convenience. The corresponding 
parametrization of A and E, as follows from (186|) . is given by 


_ e i 2 

So — a. —, eo?’ 0 

n 

Sn = 0, e n+ 2^ n+ 2 


(3 - ei|s 0 | 

1 

P - e n r 2 n ’ 


1 

e 2 r| 



tn— 1 


Gn+1 

r n +1 


1 

2 > 
e 2 r| 

n ^ 1. 


These equations are governed by a single recurrence relation 


(89) 


^n+1 


1 

(3 - x n ’ 


X r 


e 2n+l r 2n+l’ 
\ or 

^ e 2n+2 r 2n+2i 


n ^ 0, 


(90) 


which provides separately the odd and even coefficients r n , t n depending whether we 
use the initial condition Xq = e\r\ or Xq = e 2 rl- The initial conditions generating true 
solutions of the Darboux transformation with parameters are those giving x n (3 for 
n ^ 0 and guaranteeing that e 0 rQ ^ 0, i.e. 


1 , 1 

2 2 

e\ r{ e 2 r | 


^ f3. 


(91) 


Recurrence (1901) yields the continued fraction expansion 


1 I 1 I 1 I 1 I 1 | 1 ! 

vp i p - ^ ■ 

Using basic results from the theory of continued fractions ra, we obtain 

Un_l(§)-X 0 t4-2(f) 
x n = -^- 7r~, n ^ 1, 

C/ n (|)-x 0 £/„-i(f) 

where U n are the second kind Chebyshev polynomials, given by 


U n +i(t) — 2tU n (t) — U n _i(t), U- 1 — 0, Uo — 1- (92) 

Therefore, if Si = 0, the reversed Cholesky factorization 1(C) = A + EA is possible 
exactly for those choices of e\r\ and e 2 r% satisfying (j9Tj) and 


2 2 / 
ei r u e 2 r 2 A 



n ^ 1. 


(93) 


The solutions associated with a Hermitian functional are characterized by the two 
equalities in the first column of (|H7|) . which in this case become 


e o r o 


P~ 


2 

ei r{' 


(94) 


66 
















Using the expression for r$ given in (l89lh the above hcrmiticity condition turns out to 
be e\ = e 2 and r\ = r^. Therefore, bearing in mind (l9Tf and (l93]h the solutions leading 
to Hcrmitian functionals are those generated by (19Uj) with initial conditions satisfying 


2 ^ , V.(f) 

13 ° 


ei r, = x 0 = e 2 rf, n> 1. 


(95) 


The quasi-CMV solutions are those satisfying the previous hermiticity conditions 
together with those summarized in the second and third column of (l8Tjh In our case, 
these additional conditions read as 


\a\ ¥= 1, £o(l - \a\ 2 ) = eo^o, a = —, 

x 0 

which, using (l94|l . become 

7o| # 1, %o = -z ——■ 

P - xo 

This means that the quasi-CMV solutions are associated with those fixed points x n = x 0 
of (190]) lying outside of the unit circle. Since s n = 0 for n ^ 1, we find from (184|) that 
the Schur parameters of these quasi-CMV solutions are a± = a and a n = 0 for n ^ 2. 
In other words, all the quasi-CMV solutions with si = 0 is are Bernstein-Szego type. 

The fixed points x n = x 0 of (]90l) are x 0 = f + -\J (f) L — 1, which are inverse of 
each other, while the requirement |xo| ^ 1 translates into the inequality |/3| > 2. It is 
convenient to rewrite these two fixed points as 

4 = sg(0) ^|f | ± V(I) 2 “ l) > ( 96 ) 


so that |xq| = |f| + (f) 2 — 1 for \/3\ > 2. Then, |xq| > 1 and |x 0 | < 1. Both 

fixed points satisfy the hermiticity conditions (195|) . thus they give rise to two different 
quasi-CMV solutions characterized by the Schur parameters (a+, 0, 0, 0,...), with a+ = 
a/x q = ax£. Since |a+| < 1 and |a_| > 1, the Bernstein-Szego solution related to the 
initial condition Xq = Xq is positive dehnite, while that one corresponding to x 0 = Xq is 
quasi-dehnite but not positive dehnite. The related CMV matrix, C + , and quasi-CMV 
matrix, C_, have the form 


C± 


( -°± p± 

0 0 0 
+p+ a+ 0 
0 
1 


V 


1 

0 

0 0 1 
0 0 0 
0 0 0 1 
10 0 0 


\ 


7 


a+ = axQ , 

P± = VI 1 - |a+l 2 l = Vl 1 - 7o) 2 
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From the previous results we can obtain the explicit expressions of the factors A+ and 
E+ for both solutions, given by 


A± 


E ± 


( T - \ 

t± r± 

t± 0 r± 


t± 0 r- j- 
t± 0 r+ 


V 

Sg(£) 


t± 0 r± 


( ±l 

1 

1 


J 

\ 


V 




t± = sg((3) ar T . 


The CMV solution C + has a sign matrix E + = sg (j3)I which is —I for f3 < 0, which is 
in agreement with the indetermination of the global sign for E + . This means that we 
can take E + = I for C+, although, in that case, preserving the factorizations involved 
in the Darboux transformation would require to choose the sign matrix F = sg ((3)1 for 
the shift matrix T> = S. 

Summarizing, we have found that quasi-CMV solutions for the inverse Darboux 
problem with si = 0 exist iff \(3\ > 2. Under these conditions, we have a pair of such 
solutions, both Bernstein-Szego type, one of them CMV and the other one strictly quasi- 
CMV. Since the shift matrix S is associated with the Lebesgue measure dd(e lG ) = d0/2n 
on the unit circle, the CMV solution is that one of the Bernstein-Szego measure dd/£, 
and the sign of E + has to do with the fact £ is positive or negative on the unit circle 
depending whether (3 > 2 or (3 < —2. Bearing in mind that £ has roots £+ = — a+ 
for \(3\ > 2, every Hcrmitian solution u of u£ = did is in that case given by u = 
did/£ + m8^ + +m5^_ for some complex mass m. Thus, the quasi-CMV solution must 
be related to such a functional u for a value of m s C which makes it quasi-dehnite. 

If Si = 0, no quasi-CMV solution appears when |/?| ^ 2, which corresponds to l 
having its roots on the unit circle. This is expected when searching for CMV solutions 
because no positive measure on the unit circle satisfies id\i = d£) in that case. Nev¬ 
ertheless, it is a less trivial result about quasi-CMV solutions of inverse Darboux and 
quasi-dehnite solutions u of u£ = did. 

So far, we have completed the search for quasi-CMV matrices in the subset of solu¬ 
tions with si = 0. Now we will have a look at some spurious solutions of this subset. 
The simplest ones should be those obtained by fixing e\r\ = r\ = xq as a fixed point 
of (1901) for |/31 ^ 2. These fixed points lie on the unit circle, so they are candidates to 
generate spurious solutions only when x 0 = +1, which corresponds to [3 = +2. However, 
choosing e\r\ = e ?= +1 for (3 = +2 gives no solution of the Darboux transformation 
with parameters because, using ([55]) . we find that = 0. Thus, no spurious solution 
with si = 0 has an eventually constant sequence e n r 

We can reinterpret the previous results by stating that, in the case Si = 0, the 
quasi-CMV solutions are those whose factors A and E have constant diagonals up to 
finitely many entries. Therefore, the solutions given by factors A and E with eventually 







periodic diagonals are spurious whenever the period is greater than one. These are the 
simplest spurious solutions, which are generated by the periodic solutions of fj90]) . 

The periodic solutions of (jUUj) with period j are characterized by the relation 

X ° - XoU^d) ' 

Using dS2|) , this equation can be rewritten as 

(xg + /3x 0 + 1) Uj- 1 (|) = 0, 

which means that either Xq + /3x 0 + 1 = 0 or Uj_ i(f) = 0. 

The zeros of Xq + /3x 0 + 1 are the fixed points x 0 = Xq given in and leading to 
the quasi-CMV solutions previously found. Therefore, if f/j(|) ^ 0 for every j ^ 1, the 
spurious solutions we are interested in may arise only from two alternatives: e\r{ = Xg , 
e 2 r\ = Xg or e±rl = Xg , e 2 T 2 = Xg . However, (|89l) shows that both options lead to 
eoXg = (3—x o —Xg = 0, thus none of them yield a solution of the Darboux transformation 
with parameters. 

Therefore, the kind of spurious solutions we are looking for only appear when j3 = (3j^ 
is a zero of i(f), 


Pj,k = 2 cos ( j7r ) , k = 1, 2,..., j - 1. 


Actually, for these values of [3 all the solutions are spurious because quasi-CMV solutions 
require \(3\ > 2. 

Consider the simplest case j = 2, so that /3 = 0. Then, (1UU|) generates a 2-periodic 
sequence x 0 , — Xq 1 , x 0 , — Xq ^ ... for every initial value x 0 A 0. Hence, if Si = 0, the 
solutions of the Darboux transformation with parameters whose factors A and E have 
eventually 4-periodic diagonals appear for j3 = 0. The parameters eir^,e 2 r 2 e K.\{0} 
describing these solutions are only restricted by (l9T]h which now reads as eirf + e 2 r| ^ 
0. Although these solutions are necessarily spurious, the associated functionals are 
Hermitian when eir^ = 62 ^, as required by In the Hermitian case, A and E, 

obtained from (189|) . are given in terms of e = e\ = e 2 and r = r\ = r 2 by 


A = 


( r 0 
to r 
to 0 r 

t\ 0 1/r 
t\ 0 1/r 
to 0 r 
to 0 r 


/ 


-1 


E = e 


V 


-1 


-1 


r 0 = 


V2 


r 


a 

to = e —, 
r 


ti = —ear. 


(97) 
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The corresponding spurious solution has the form 


C = 


_ a 

~ e ~ T 7 

A2 

7T 

0 

0 

0 

0 

0 

0 

0 

0 

cf 2 A 

V2 

eaA 

0 

r 2 

0 

0 

0 

0 

0 

0 

V 

V2 

e “ 
r A 

0 

0 

0 

0 

0 

0 

0 

0 

a 3 A 

6 V2 

—a 2 A 

0 

— eaA 

0 

i 

-pi 

0 

0 

0 

0 

aS 

0 

1 

-pi 

0 

0 

0 

0 

0 

0 

0 

CK 4 A 

-ea 3 A 

0 

—a 2 A 

0 

eaA 

0 

r 2 

0 

0 

a 2 A 

V2 

0 

eaA 

0 

r 2 

0 

0 

0 

0 

0 

a 5 A 

a 4 A 

0 

ea 3 A 

0 

—a 2 A 

0 

—eaA 

0 

l 

pi 

a 3 A 

V2 

0 

— a 2 A 

0 

—eaA 

0 

1 

-pi 

0 

0 

0 




A = r 2 + r 2 , 
V = r 2 — r -2 . 


9 Conclusions and Outlook 


We have developed a theory of Darboux transformations for CMV matrices, the unitary 
analogue of Jacobi matrices. They share many properties with the Darboux transfor¬ 
mations of Jacobi matrices, among them the equivalence with (Laurent) polynomial 
modifications of the underlying measures and, as a consequence, the almost isospectral- 
ity. Indeed, the Szego connection between orthogonal polynomials on the real line and 
the unit circle identifies the Darboux transformations of Jacobi matrices as the Szego 
projection of the Darboux transformations of CMV matrices introduced in this paper. 

Nevertheless, the fact that Darboux for CMV has to deal with unitary instead of 
Hermitian matrices causes some dissimilarities with the Jacobi case which are worth 
highlighting. In particular, the unitarity forces us to use Hermitian Laurent polynomials 
instead of simply real polynomials, so that, in contrast with the Jacobi case, the minimal 
step of Darboux for CMV involves polynomials with two zeros. These facts cause 
a number of differences between Darboux for Jacobi and CMV which we summarize 
below, together with the open problems that they suggest: 

• Darboux transformations of Jacobi matrices are based on a factorization of a 
matrix which retains the hermiticity of the Jacobi one. However, the Darboux 
transformations of CMV matrices introduced in this paper need the factorization 
of matrices which do not share the unitarity of CMV, but are also Hermitian. 
The reason for this is that the method we have used to obtain a unitary version of 
Darboux is to relate a unitary matrix with a Hermitian one to which we apply the 
standard Darboux procedure for self-adjoint operators. Is it possible to develop a 
new theory of Darboux transformations for unitary operators without abandoning 
unitarity (i.e. based on splittings of unitary operators built out of the previous 
ones)? 

• Darboux transformations of Jacobi matrices are usually implemented via LU fac¬ 
torizations of non-symmetric tridiagonal matrices associated with monic instead 
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of orthonormal polynomials. This allows for the unified treatment of the gen¬ 
eral quasi-dehnite case. Nevertheless, in the positive definite case they can be 
formulated in terms of Cholesky factorizations. This opens a way for the CMV 
generalization of Darboux transformations as defined in this paper, while Sec¬ 
tion [82] shows that generalized Cholesky factorizations permit their extension to 
the quasi-dehnite case. However, a question remains: Is there any CMV ana¬ 
logue of the LU version for the Darboux transformations (i.e. based on matrix 
representations with respect “monic” zig-zag bases)? 

• Darboux transformations for Jacobi matrices can be implemented via Cholesky 
factorizations and reversed ones or, equivalently, using the Cholesky factor as a 
change of basis relating Jacobi matrices by conjugation. While the latter method 
is directly generalizable to CMV matrices, the first one seems hard to tackle in the 
CMV case due to the drawbacks in recovering an infinite matrix from its image 
by an Hermitian Laurent polynomial. This leads to the following question: Given 
a Hermitian Laurent polynomial of degree one, is there any simple procedure to 
recover a CMV matrix from the result of evaluating the Laurent polynomial on 
such a matrix? 

• In the Jacobi case, the Darboux transformations with parameters can be identi¬ 
fied with the inverse Darboux transformations, which correspond exactly to the 
Geronimus transformations of measures on the real line. The Geronimus transfor¬ 
mations of measures on the unit circle are a particular case of dividing a measure 
by an Hermitian Laurent polynomial and adding Dirac deltas at their zeros. These 
more general transformations of measures on the unit circle correspond exactly 
to the inverse Darboux transformations of CMV matrices, whose matrix realiza¬ 
tion is given by the Darboux transformations with parameters. However, these 
latter transformations present spurious solutions which are not CMV neither uni¬ 
tary matrices, and can be eventually associated with non-Hermitian functionals. 
Is there any interesting interpretation of these spurious solutions? Can they be 
understood in terms of orthonormal Laurent polynomials with respect to non¬ 
standard or matrix inner products? 

A positive answer to the last item is suggested by the recently found connection 
between higher degree Darboux transformations of Jacobi matrices and Sobolev or¬ 
thogonal polynomials on the real line [TB1 [T9 ]. A similar connection for the CMV case 
should be the starting point for a new approach to non-standard orthogonality on the 
unit circle. Other promising lines of future research are indicated by the success of 
Darboux techniques for Jacobi matrices in problems of numerical linear algebra, the 
study of integrable systems or the search for bispectral situations. The recent connec¬ 
tion found between CMV matrices and the so called quantum walks [9] also suggests a 
possible use of Darboux for CMV in the study of discrete quantum dynamical systems. 
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Appendix A Cholesky factorizations 


Given an infinite Hermitian complex matrix M, the existence of a generalized Cholesky 
factorization (GCF) 

M = AEA + , E e y, (98) 

is equivalent to state that M is quasi-definite, i.e. all its leading submatrices are non¬ 
singular. Obviously the condition is necessary because, denoting by the subindex n the 
leading submatrix of order n, (I98p implies that M n = A n E n A+. The sufficiency of the 
condition follows by an inductive reasoning on the order of the leading submatrices of 
M. More precisely, if M n = A n E n A+ is the GCF of the non-singular leading submatrix 
M n and 


A n +1 

then 


A n 0 

x n 


m _ ,'m„ y+ 

M ” +1 ” 1 Y n y n 


, X n 6 C n , x n > 0; 


Y n e C n , y n e M, 
E n 0 


E n +\ 


0 


> ^ {1 ? 1 }) 


M, 


n +1 


A n +1 -En+1 -d n _|_ i 


A E X + = Y + 

/ 1 n 1 J n A ri 1 n ) 

X n E n X n + 6 n X n Vm 


X + = E A~ X Y + 


en^n = Vr 


Y M~ X Y + 

1 n lvl n 1 n • 


These equations have solutions X n e C n , e n e (1, —1}, > 0, whenever M n+1 is 

non-singular because, then, y n ^ YnM^Y^ since otherwise (— YnM^ 1 , l)M n+ i = 0. 
This proves that the quasi-dehniteness of M guarantees the existence of the GCF of 
its leading submatrices, obtained by enlarging those of the smaller leading submatrices. 
This fact is key to ensure the existence of the factorization (j98|) for the infinite matrix 
M because we can take A and E as the only infinite matrices whose leading submatrices 
of order n are respectively A n and E n for all n. 

GCF of quasi-definite Hermitian infinite matrices M involve only admissible prod¬ 
ucts (in the sense introduced in Section [2]), but the reversed ones 


M = A + EA , Ee/, 


(99) 


can be nonsense due to the presence of non-admissible products. A class of infinite 
matrices for which the existence of reversed GCF is a well posed problem concerning 
admissible products, are the band matrices, which lead to band triangular factors A. 

Nevertheless, even for finite matrices, quasi-dehniteness is not necessary neither suffi¬ 
cient for the existence of reversed GCF. For instance, (} J) has no reversed factorization 
despite its quasi-dehniteness, while (? }) = (o 1) ("o 1 i) ( i i ) ha reversed factorization 
of a non-quasi-definite matrix. Actually, the existence of a reversed GCF of a finite ma¬ 
trix M is equivalent to the non-singularity of all the principal submatrices which are a 
south-east corner of M (in what follows we will say that M is ‘reversed’ quasi-dehnite). 
This follows from similar inductive arguments to those given previously for GCF. 
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The problem of the reversed quasi-definite condition is that it is nonsense for infinite 
matrices. The core of the difficulties in characterizing the existence of reversed GCF for 
infinite matrices is that, in contrast to the non-reversed ones, they are not necessarily 
obtained by enlarging those of the leading submatrices because (jU9]l does not imply 
M n = AfE n A n . This argument also shows that the relation between the reversed GCF 
of the leading submatrices is non-trivial. Indeed, an infinite Hermitian matrix having 
a reversed GCF may have no such a reversed factorization for any leading submatrix. 
This is illustrated by 
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where no leading submatrix is reversed quasi-definite due to its null last diagonal entry. 
Another example is (JHHj) for P = 0, which has reversed GCF given by (1971) . 

These drawbacks of the reversed GCF dissapear when restricting the attention to 
standard Cholesky factorizations (CF), i.e. E = I. The same kind of arguments given 
at the beginning of this section prove that the existence of a CF, 


M = AA + , 


for an infinite Hermitian matrix M is equivalent to the positivity of its leading principal 
minors. This condition also characterizes those Hermitian matrices M which are positive 
definite, i.e. XMX + > 0 for every finitely non-null row vector X ^ 0. Besides, positive 
definiteness is necessary for the existence of a reversed CF, 

M = A + A, 

because XMX + = XA + AX + = ||AA + j| 2 > 0 for every finitely non-null vector X since 
ker i? (A) = {0}. 

In the case of finite matrices, positive definiteness is also characterized by the pos¬ 
itivity of the principal minors corresponding to all the south-east corner submatrices. 
As a consequence, the existence of CF and reversed CF takes place simultaneously and 
is equivalent to positive definiteness. However, establishing a similar result for infinite 
matrices requires more technical hypothesis and arguments to deal with the non-trivial 
relation between the reversed factorizations of an infinite matrix and its leading sub- 
matrices. This is accounted for in the following proposition, which also summarizes the 
previous results for convenience. 

Proposition A.l. If M is an infinite Hermitian complex matrix, then 

M has a GCF <=> M is quasi-definite 
M has a CF o M is positive definite M has a reversed CF 

If, besides, M is band matrix with bounded entries and its lower and upper diagonals 
have only non-null entries, then 


M has a reversed CF <=> M is positive definite 


73 








Proof. It only remains to prove the left-handed implication of the last statement. The 
positive definiteness of M is equivalent to the existence of a reversed CF, M n = AfA n , 
for every leading submatrix M n of M. The subindex n in A n only refers to the depen¬ 
dence on the order of the leading sub matrix M n . In other words, we cannot assume 
that all the factors A n are leading submatrices of the same infinite matrix A because 
Ak is not necessarily a leading submatrix of A n for k < n. Our strategy consists in 
proving that, despite the non-trivial relation between the different matrix factors A n , a 
subsequence of them “converges” in some sense to an infinite matrix A which provides 
a reversed CF, M = A + A. 

Bearing in mind that M is a band matrix, the boundedness of its entries is equivalent 
to state that it represents a bounded operator X MX in the Hilbert space Sj of 
square-summable sequences. If P n is the orthogonal projection of f) onto the subspace 
f) n spanned by the first n canonical vectors of fj, then 

Mn : = P n MPn = M n © / = i+i n , A n := A n © /, 

where M n and A n act on Sj n , while the identity acts on its orthogonal subspace. Since 
||A n || 2 = 1 M n | ^ ||M||, there exists a subsequence of A n weakly converging to some 
bounded operator A on Sj [53j Theorem 5.70]. The proposition is proved if we show 
that M = A + A and A s 7T, where, as in the case of M, we identify A with its matrix 
representation in the canonical basis of Sj. 

Suppose that M is 2N + 1-diagonal. Then, each M n is also 2N + 1-diagonal, hence A n 
is N + 1-band lower triangular with positive diagonal entries. Since weak convergence 
implies convergence of the matrix coefficients, we conclude that A is IV + 1-band lower 
triangular with non-negative diagonal entries. To prove that As A? we only need to see 
that its diagonal entries are non-null. This follows directly from the identity M = A + A, 
which we will prove below, and the hypothesis that the lower and upper diagonals of 
M have no null entries. 

As for the identity M = A + A, note first that M n+ ^X = MX for every X e S) n 
because M is 2N +1-diagonal. In consequence, M n is strongly convergent to M because 

| (M n+ N - M)x I = II (M n+N M)(X - P n X) I < 2||M||||X - P n X || 
and ||X — P n X || —» 0 for every X s S). 

Let us denote also by A n the subsequence weakly converging to A, no misunder¬ 
standing will appear from this abuse of language. In contrast with strong convergence, 
weak limits do not commute with compositions of operators, so we cannot conclude 
directly that M = A + A just taking weak limits in M n = AfA n . Luckily enough, the 
weak convergence of A n implies its strong convergence due to the N + 1-band structure 
of A n and the uniform bound ||A n || ^ ||M|| (see Lemma IA.2I below!. Bearing in mind 
that strong convergence preserves compositions [H], taking strong limits in M n = AfA n 
we finally obtain M = A + A. □ 

The proof of the previous proposition makes use of the following lemma. 
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Lemma A.2. Let B n be a sequence of infinite band matrices with uniformly bounded 
entries and a uniformly bounded size of the band. Then, if B n is weakly convergent, it 
is also strongly convergent. 

Proof. First, ||5 n || is bounded since B n is uniformly banded with uniformly bounded 
entries. Therefore, the weak limit B of B n is also bounded. Since the entries of B are 
the limits of the entries of B n , the weak limit B inherits the band structure of B n . Due 
to this band structure, if X e Sj is finitely non-null, (B n — B)X only involves a finite n- 
independent number of matrix entries of both, B n and B. Therefore, \\(B n — B)X\\ —> 0 
for every finitely non-null vector X. Given an arbitrary vector X e h, consider the 
inequalities 


II {B n - B)X | < | (B n - B){X P k X )| + \\(B n - B)P k X\\ 

< (sup ||B n || + ||5||)||X - P k X 1 + || (B n B)P k X ||. 

n 

We know that ||A — P*,X|| -A 0, while || (B n — B)P k X\\ —> 0 because PkX is finitely 
non-null for any fixed k. Thus, we can make both summands of the last line arbitrarily 
small by choosing k and n big enough. This implies that || (B n — B)X || —>■ 0, proving 
that B is a strong limit of B n . □ 

The inductive proof of the existence of GCF given at the beginning of this section 
also shows that it is unique. On the contrary, the proof of Proposition [Aj] suggests that 
a positive definite Hermitian matrix can have multiple reversed CF, at least so many 
as limit points has the sequence A n arising from the CF, M n = A+A n , of the leading 
submatrices M n . The examples presented in the previous sections prove that this is 
indeed the case, and reversed GCF are in general not unique. 

A more abstract proof of the uniqueness of a GCF sheds light on the lack of unique¬ 
ness for reversed factorizations and its relation with the algebra of infinite matrices. 
Suppose that 

A\EiAf = A 2 E 2 A 2 , Aj e AT, Ei e ST. 

Using the associativity laws of Proposition 12.31 we get EiAf{Af)~ l = Af 1 A 2 E 2 . The 
left-hand side of this equality is upper triangular, while the right-hand side is lower 
triangular, so both must be diagonal. Thus, A+(AJ) _1 = D = Af 1 A 2 is a positive 
diagonal matrix and Ei = E 2 . Taking care of the associativity again with Proposi- 
tion 12.31 we find that D + D = (Af (Af)~ l ) + (Af 1 A 2 ) = /, which implies that D = I and 
A 2 = A 1 . 

The above proof of the uniqueness of GCF works equally well for finite or infinite ma¬ 
trices thanks to the associativity of the involved operations, ensured by Proposition 12.31 
in the case of infinite matrices. A similar proof of uniqueness also works for reversed 
GCF of finite matrices, but fails in the case of infinite matrices because Proposition 12.31 
does not guarantee the associativity of the required operations. To show this, assume 
that 

Af E\A\ = Af E 2 A 2 , A, e AT , Ei e TX , 
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where A are band matrices so that the products are admissible. Proposition 12.31 allows 
us to deduce that 

AtE l = (A+E^Ai 1 = (A+E 2 A 2 )Ai 1 = A+(E 2 A 2 A^). 

However, when trying to move to the left the factor AJ we only can conclude that 

(A+)~ 1 A+E 1 = (H+)- 1 (H+ J B 1 ) = {A+)~ l A+(E 2 A 2 A^), 

but the right-hand side cannot be further simplified using Proposition 12.31 because 
(A]") -1 is upper triangular and A 7 1 is lower triangular. Thus, the lack of associa¬ 
tivity for products of infinite matrices is the algebraic reason for the lack of uniqueness 
of reversed GCF. This problem does not appear for finite matrices, whose reversed GCF 
are unique. 

If C is a CMV matrix and £ is a Hermitian Laurent polynomial with N z zeros, then 
£(C) is a Hermitian 2 N z + 1-diagonal matrix with bounded entries due to the unitarity 
of C. Besides, the lower and upper bands of £(C) have no null entries, as follows from 
the zig-zag shape (J9J) of C. Therefore, all the previous results apply to the matrices £(C) 
whose CF and reversed CF are involved in Darboux for CMV. 
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